New Features of Apache Spark 3.5 - In-Depth Analysis
Offered By: Databricks via YouTube
Course Description
Overview
Explore the cutting-edge features of Apache Spark 3.5 in this 33-minute talk by Daniel Tenedorio and Xiao Li from Databricks. Dive into Spark Connect's enhanced accessibility, DeepSpeed's AI efficiency integration, and performance optimizations. Learn about new PySpark and SQL capabilities, including array manipulation functions, SQL IDENTIFIER clause improvements, expanded API support, and Arrow-optimized Python UDFs. Gain insights into building scalable, efficient, and robust data-driven applications using the latest advancements in big data processing and AI. After the talk, access additional resources like the Big Book of Data Engineering and The Data Team's Guide to the Databricks Lakehouse Platform for further exploration.
Syllabus
An In Depth Look at the New Features of Apache Spark 3.5
Taught by
Databricks
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera