YoVDO

Building Data Lakehouse from Scratch - End-to-End Data Engineering Project

Offered By: CodeWithYu via YouTube

Tags

Data Engineering Courses Machine Learning Courses Amazon Web Services (AWS) Courses Apache Spark Courses Apache Kafka Courses Apache Flink Courses Data Analytics Courses Lambda Functions Courses Delta Lake Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn to design, implement, and maintain a secure, scalable, and cost-effective data lakehouse architecture in this comprehensive end-to-end data engineering project. Explore advanced techniques using Apache Spark, Apache Kafka, Apache Flink, Delta Lake, AWS, and open-source tools to unlock data's full potential through analytics and machine learning. Dive into modern system architectures, create databases, utilize Glue crawlers, and automate data orchestration with Lambda functions on AWS Cloud. Gain hands-on experience in coding, optimizing, and verifying results while mastering the intricacies of building a robust data lakehouse from scratch.

Syllabus

Introduction
The system architecture
The modern system architecture
Implementation of the Current Data Lakehouse on AWS Cloud
Creating Databases for Data Lakehouse
Using Glue crawler for Data Lakehouse
Using Lambda function to automate data orchestration on AWS Cloud
Coding the Lambda function
Optimising Lambda Function
Verification of Results
Outro


Taught by

CodeWithYu

Related Courses

Understanding China, 1700-2000: A Data Analytic Approach, Part 1
The Hong Kong University of Science and Technology via Coursera
The Analytics Edge
Massachusetts Institute of Technology via edX
大数据与信息传播 Big Data and Information Dissemination
Fudan University via Coursera
The Future of Fashion
Marist College via Independent
The Mobile Consumer
Marist College via Independent