Master Databricks and Apache Spark - SparkR Usage - Lesson 35
Offered By: Bryan Cafferky via YouTube
Course Description
Overview
Explore the integration of R programming with Databricks and Apache Spark using the SparkR package in this comprehensive tutorial. Dive into the architecture, Apache Arrow, and the SparkR API while learning about open-source libraries and example code. Master essential techniques such as masking, SparkSQL, and SparkR SQL. Discover how to create local dataframes, print schemas, and perform data transformations using piping, mutate, and aggregate functions. Practice DataFrame operations, including joins and merges, and tackle a coding challenge to reinforce your newly acquired skills.
Syllabus
Intro
Architecture
Apache Arrow
What is SparkR
SparkR API
Open Source Libraries
Example Code
Masking
SparkSQL
SparkR SQL
SparkR Display
Masking Objects
Creating a Local Dataframe
SparkR Print Schema
SparkR Piping
Transform
Mutate
DataFrame
Aggregate Functions
DataFrame Operations
Join
Merge
Challenge
Code
Notebook
Taught by
Bryan Cafferky
Related Courses
Data Processing with AzureLearnQuest via Coursera Mejores prácticas para el procesamiento de datos en Big Data
Coursera Project Network via Coursera Data Science with Databricks for Data Analysts
Databricks via Coursera Azure Data Engineer con Databricks y Azure Data Factory
Coursera Project Network via Coursera Curso Completo de Spark con Databricks (Big Data)
Coursera Project Network via Coursera