Introducing the New Python Data Source API for Apache Spark
Offered By: Databricks via YouTube
Course Description
Overview
Explore the groundbreaking Python Data Source API for Apache Spark™ in this 27-minute conference talk by Databricks. Discover how this new API simplifies big data processing for Python developers, eliminating the need for Scala knowledge when integrating custom data sources into Spark. Learn about key features, including streamlined reading and writing operations, and understand how this innovation makes the big data ecosystem more accessible. Gain insights from Databricks engineers Allison Wang and Ryan Nienhuis, along with a customer co-presenter, as they discuss the API's impact on expanding Spark's reach within the Python community. Delve into additional resources like the Big Book of Data Engineering and The Data Team's Guide to the Databricks Lakehouse Platform to further enhance your understanding of data engineering concepts.
Syllabus
Introducing the New Python Data Source API for Apache Spark™
Taught by
Databricks
Related Courses
内存数据库管理openHPI CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX Processing Big Data with Azure Data Lake Analytics
Microsoft via edX Google Cloud Big Data and Machine Learning Fundamentals en Español
Google Cloud via Coursera Google Cloud Big Data and Machine Learning Fundamentals 日本語版
Google Cloud via Coursera