YoVDO

Introducing the New Python Data Source API for Apache Spark

Offered By: Databricks via YouTube

Tags

Apache Spark Courses Python Courses Databricks Courses Data Engineering Courses Data Integration Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the groundbreaking Python Data Source API for Apache Spark™ in this 27-minute conference talk by Databricks. Discover how this new API simplifies big data processing for Python developers, eliminating the need for Scala knowledge when integrating custom data sources into Spark. Learn about key features, including streamlined reading and writing operations, and understand how this innovation makes the big data ecosystem more accessible. Gain insights from Databricks engineers Allison Wang and Ryan Nienhuis, along with a customer co-presenter, as they discuss the API's impact on expanding Spark's reach within the Python community. Delve into additional resources like the Big Book of Data Engineering and The Data Team's Guide to the Databricks Lakehouse Platform to further enhance your understanding of data engineering concepts.

Syllabus

Introducing the New Python Data Source API for Apache Spark™


Taught by

Databricks

Related Courses

Web sémantique et Web de données
Inria (French Institute for Research in Computer Science and Automation) via France Université Numerique
Linked Data Engineering
openHPI
Implementing ETL with SQL Server Integration Services
Microsoft via edX
Advanced Manufacturing Enterprise
University at Buffalo via Coursera
Big Data Services: Capstone Project
Yandex via Coursera