YoVDO

Building Data Lakehouse from Scratch - End-to-End Data Engineering Project

Offered By: CodeWithYu via YouTube

Tags

Data Engineering Courses Machine Learning Courses Amazon Web Services (AWS) Courses Apache Spark Courses Apache Kafka Courses Apache Flink Courses Data Analytics Courses Lambda Functions Courses Delta Lake Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn to design, implement, and maintain a secure, scalable, and cost-effective data lakehouse architecture in this comprehensive end-to-end data engineering project. Explore advanced techniques using Apache Spark, Apache Kafka, Apache Flink, Delta Lake, AWS, and open-source tools to unlock data's full potential through analytics and machine learning. Dive into modern system architectures, create databases, utilize Glue crawlers, and automate data orchestration with Lambda functions on AWS Cloud. Gain hands-on experience in coding, optimizing, and verifying results while mastering the intricacies of building a robust data lakehouse from scratch.

Syllabus

Introduction
The system architecture
The modern system architecture
Implementation of the Current Data Lakehouse on AWS Cloud
Creating Databases for Data Lakehouse
Using Glue crawler for Data Lakehouse
Using Lambda function to automate data orchestration on AWS Cloud
Coding the Lambda function
Optimising Lambda Function
Verification of Results
Outro


Taught by

CodeWithYu

Related Courses

CS115x: Advanced Apache Spark for Data Science and Data Engineering
University of California, Berkeley via edX
Big Data Analytics
University of Adelaide via edX
Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera
Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera
Introduction to Apache Spark and AWS
University of London International Programmes via Coursera