PySpark with Python
Offered By: YouTube
Course Description
Overview
Dive into the world of big data processing with this comprehensive tutorial series on PySpark and Python. Begin with an introduction to PySpark and its installation, then progress through hands-on lessons on DataFrame operations, including handling missing values and performing filter operations. Explore advanced topics such as GroupBy and aggregate functions, and gain an introduction to PySpark MLlib for machine learning applications. Delve into the mathematical intuition behind linear regression for data science, and learn how to use Databricks for PySpark development. Conclude with a practical implementation of multiple linear regression in Databricks, equipping you with essential skills for large-scale data processing and analysis.
Syllabus
Tutorial 1-Pyspark With Python-Pyspark Introduction and Installation.
Tutorial 2-Pyspark With Python-Pyspark DataFrames- Part 1.
Tutorial 3- Pyspark With Python-Pyspark DataFrames- Handling Missing Values.
Tutorial 4- Pyspark With Python-Pyspark DataFrames- Filter Operations.
Tutorial 5- Pyspark With Python-GroupBy And Aggregate Functions.
Tutorial 6- Pyspark With Python-Introduction To Pyspark Mlib.
Tutorial 26- Linear Regression Indepth Maths Intuition- Data Science.
Tutorial 7- Pyspark With Python|Introduction To Databricks.
Tutorial 8- Pyspark Multiple Linear Regression Implementation In Databricks.
Taught by
Krish Naik
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera