New Features of Apache Spark 3.5 - In-Depth Analysis
Offered By: Databricks via YouTube
Course Description
Overview
Explore the cutting-edge features of Apache Spark 3.5 in this 33-minute talk by Daniel Tenedorio and Xiao Li from Databricks. Dive into Spark Connect's enhanced accessibility, DeepSpeed's AI efficiency integration, and performance optimizations. Learn about new PySpark and SQL capabilities, including array manipulation functions, SQL IDENTIFIER clause improvements, expanded API support, and Arrow-optimized Python UDFs. Gain insights into building scalable, efficient, and robust data-driven applications using the latest advancements in big data processing and AI. After the talk, access additional resources like the Big Book of Data Engineering and The Data Team's Guide to the Databricks Lakehouse Platform for further exploration.
Syllabus
An In Depth Look at the New Features of Apache Spark 3.5
Taught by
Databricks
Related Courses
Fundamentals of Scalable Data ScienceIBM via Coursera Data Science and Engineering with Spark
Berkeley University of California via edX Master of Machine Learning and Data Science
Imperial College London via Coursera Data Analysis Using Pyspark
Coursera Project Network via Coursera Building Machine Learning Pipelines in PySpark MLlib
Coursera Project Network via Coursera