Master Databricks and Apache Spark - SparkR Usage - Lesson 35
Offered By: Bryan Cafferky via YouTube
Course Description
Overview
Explore the integration of R programming with Databricks and Apache Spark using the SparkR package in this comprehensive tutorial. Dive into the architecture, Apache Arrow, and the SparkR API while learning about open-source libraries and example code. Master essential techniques such as masking, SparkSQL, and SparkR SQL. Discover how to create local dataframes, print schemas, and perform data transformations using piping, mutate, and aggregate functions. Practice DataFrame operations, including joins and merges, and tackle a coding challenge to reinforce your newly acquired skills.
Syllabus
Intro
Architecture
Apache Arrow
What is SparkR
SparkR API
Open Source Libraries
Example Code
Masking
SparkSQL
SparkR SQL
SparkR Display
Masking Objects
Creating a Local Dataframe
SparkR Print Schema
SparkR Piping
Transform
Mutate
DataFrame
Aggregate Functions
DataFrame Operations
Join
Merge
Challenge
Code
Notebook
Taught by
Bryan Cafferky
Related Courses
Social Network AnalysisUniversity of Michigan via Coursera Intro to Algorithms
Udacity Data Analysis
Johns Hopkins University via Coursera Computing for Data Analysis
Johns Hopkins University via Coursera Health in Numbers: Quantitative Methods in Clinical & Public Health Research
Harvard University via edX