A Crash Course In PySpark
Offered By: Udemy
Course Description
Overview
Learn all the fundamentals of PySpark
What you'll learn:
What you'll learn:
- PySpark, Apache Spark, Big Data Analytics, Big Data Processing, Python
Spark is one of the most in-demand Big Data processing frameworks right now.
This course will take you through the core concepts of PySpark. We will work to enable you to do most of the things you’d do in SQL or Python Pandas library, that is:
Getting hold of data
Handling missing data and cleaning data up
Aggregating your data
Filtering it
Pivoting it
And Writing it back
All of these things will enable you to leverage Spark on large datasets and start getting value from your data.
Let’s get started.
Taught by
Kieran Keene
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera