Apache Spark Structured APIs Using Databricks
Offered By: NashKnolX via YouTube
Course Description
Overview
Learn about Apache Spark Structured APIs and their application in data manipulation using Databricks in this 41-minute tutorial. Explore key concepts including Spark's evolution, RDD implementation, DataFrame API, and database functionality. Gain hands-on experience with projection, filtering, aggregation, and data visualization techniques. Discover the essentials of ETL operations and understand how to effectively maneuver various data types using Spark Structured APIs in a Databricks environment.
Syllabus
Introduction
Agenda
What is Spark
Spark Timeline
What is RDD
RDD Implementation
Stretching
DataFrame API
Projection and Filter
Aggregation
Data Set
Data Set Visualization
Data Aggregation
What is Database
Database Functionality
Database Walkthrough
ETL operation
Taught by
NashKnolX
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera