YoVDO

Pandas on Spark - Simplicity of Pandas with Efficiency of Spark

Offered By: Databricks via YouTube

Tags

Data Science Courses Big Data Courses Machine Learning Courses Python Courses SQL Courses Apache Spark Courses pandas Courses Data Processing Courses Distributed Computing Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a 30-minute talk by Databricks experts Matthew Powers and Xinrong Meng on Pandas API on Spark, a powerful solution that combines the simplicity of pandas with the scalability of Apache Spark. Learn how this tool addresses the limitations of traditional pandas by enabling distributed data processing for large datasets. Discover how to get started with Pandas on Spark and adapt existing pandas code to handle massive data volumes efficiently. Gain insights into leveraging SQL and machine learning capabilities for enhanced data analysis and processing. Perfect for data scientists and analysts looking to scale their Python-based data workflows without sacrificing the familiar pandas interface.

Syllabus

Pandas on Spark: Simplicity of Pandas with Efficiency of Spark


Taught by

Databricks

Related Courses

Introduction to Artificial Intelligence
Stanford University via Udacity
Natural Language Processing
Columbia University via Coursera
Probabilistic Graphical Models 1: Representation
Stanford University via Coursera
Computer Vision: The Fundamentals
University of California, Berkeley via Coursera
Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent