End-to-End Data Engineering on Azure Spark Cluster: Japan Visa Analysis
Offered By: CodeWithYu via YouTube
Course Description
Overview
Learn to set up a Spark master-worker architecture in a Docker container on Azure and perform end-to-end data processing and visualization of Japan visa numbers using PySpark and Plotly. Set up cloud clusters, read and clean data with PySpark, apply data transformation techniques, and create interactive visualizations with Plotly Express. Gain practical experience in data engineering, from system architecture setup to exporting visualizations and cleaned data. Follow along with step-by-step instructions, including timestamps for each section, to master cloud-based data processing and analysis techniques.
Syllabus
Introduction
Setting up the system architecture
Setting up cloud clusters
Coding
Results
Taught by
CodeWithYu
Related Courses
Intro to StatisticsStanford University via Udacity Introduction to Data Science
University of Washington via Coursera Passion Driven Statistics
Wesleyan University via Coursera Information Visualization
Indiana University via Independent DCO042 - Python For Informatics
University of Michigan via Independent