YoVDO

End-to-End Data Engineering on Azure Spark Cluster: Japan Visa Analysis

Offered By: CodeWithYu via YouTube

Tags

PySpark Courses Data Visualization Courses Cloud Computing Courses Docker Courses Data Engineering Courses Plotly Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn to set up a Spark master-worker architecture in a Docker container on Azure and perform end-to-end data processing and visualization of Japan visa numbers using PySpark and Plotly. Set up cloud clusters, read and clean data with PySpark, apply data transformation techniques, and create interactive visualizations with Plotly Express. Gain practical experience in data engineering, from system architecture setup to exporting visualizations and cleaned data. Follow along with step-by-step instructions, including timestamps for each section, to master cloud-based data processing and analysis techniques.

Syllabus

Introduction
Setting up the system architecture
Setting up cloud clusters
Coding
Results


Taught by

CodeWithYu

Related Courses

内存数据库管理
openHPI
CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX
Processing Big Data with Azure Data Lake Analytics
Microsoft via edX
Google Cloud Big Data and Machine Learning Fundamentals en Español
Google Cloud via Coursera
Google Cloud Big Data and Machine Learning Fundamentals 日本語版
Google Cloud via Coursera