Master Databricks and Apache Spark - SparkR Usage - Lesson 35
Offered By: Bryan Cafferky via YouTube
Course Description
Overview
Explore the integration of R programming with Databricks and Apache Spark using the SparkR package in this comprehensive tutorial. Dive into the architecture, Apache Arrow, and the SparkR API while learning about open-source libraries and example code. Master essential techniques such as masking, SparkSQL, and SparkR SQL. Discover how to create local dataframes, print schemas, and perform data transformations using piping, mutate, and aggregate functions. Practice DataFrame operations, including joins and merges, and tackle a coding challenge to reinforce your newly acquired skills.
Syllabus
Intro
Architecture
Apache Arrow
What is SparkR
SparkR API
Open Source Libraries
Example Code
Masking
SparkSQL
SparkR SQL
SparkR Display
Masking Objects
Creating a Local Dataframe
SparkR Print Schema
SparkR Piping
Transform
Mutate
DataFrame
Aggregate Functions
DataFrame Operations
Join
Merge
Challenge
Code
Notebook
Taught by
Bryan Cafferky
Related Courses
Web Intelligence and Big DataIndian Institute of Technology Delhi via Coursera Big Data for Better Performance
Open2Study Big Data and Education
Columbia University via edX Big Data Analytics in Healthcare
Georgia Institute of Technology via Udacity Data Mining with Weka
University of Waikato via Independent