Apache Spark 3 - Databricks Certified Associate Developer
Offered By: Udemy
Course Description
Overview
What you'll learn:
- How to prepare for the Databricks Certified Associate Developer For Apache Spark 3 Certification Exam
- The Architecture of an Apache Spark Application
- Learn how Apache Spark runs on a cluster of computer
- Learn the Execution Hierarchy of Apache Spark
- Create DataFrame from files and Scala Collections
- Spark DataFrame API and SQL functions
- Learn the different techniques to select the columns of a DataFrame
- How to define the schema of a DataFrame and set the data types of the columns
- Apply various methods to manipulate the columns of a DataFrame
- How to filter your DataFrame based on specifics rules
- Learn how to sort data in a specific order
- Learn how to sort rows of a DataFrame in a specific order
- How to arrange the rows of DataFrame as groups
- How to handle NULL Values in a DataFrame
- How to use JOIN or UNION to combine two data sets
- How you can save the result of complex data transformations to an external storage system
- The different deployment modes of an Apache Spark Application
- working with UDFs and Spark SQL functions
- How to use Databricks Community Edition to write Apache Spark Code
Do you want to learn how to handle massive amounts of data at scale?
Learn Apache Spark 3 and pass the Databricks Certified Associate Developer for Apache Spark 3.0
Hi, My name is Wadson, and I’m a Databricks Certified Associate Developer for Apache Spark.
Apache Spark has become the standard big-data cluster processing framework in today's data-driven world.
Apache Spark is used for Data Engineering, Data Science, and Machine Learning.
I will teach you everything you need to know about starting with Apache Spark.
You will learn the Architecture of Apache Spark and use its Core APIs to manipulate complex data.
You will write queries to perform transformations such as Join, Union, GroupBy, and more.
This course is for beginners.
You don't need any previous knowledge of Apache Spark.
Notebooks are available to download so that you can follow along with me in the videos.
The Notebooks contain all the source code I use in the course.
There are also Quizzes to help you assess your understanding of the topics.
Check Out some of the top reviews and enroll in the course.
"This course is really helpful with all the necessary details needed for the Certification: Databricks Certified Associate Developer for Apache Spark 3.0.
I've cleared the certification with 80% score and I'd suggest to check all the Course contents thoroughly"
"Very good course. Gives a good overview of all the necessary components of the spark application which are required for the test and that too in very short span of time. will highly recommend this course.
worth spending time !!"
Taught by
Wadson Guimatsa
Related Courses
Big DataUniversity of Adelaide via edX Advanced Data Science with IBM
IBM via Coursera Analysing Unstructured Data using MongoDB and PySpark
Coursera Project Network via Coursera Apache Spark for Data Engineering and Machine Learning
IBM via edX Apache Spark (TM) SQL for Data Analysts
Databricks via Coursera