YoVDO

Realtime Streaming with Data Lakehouse - End-to-End Data Engineering Project

Offered By: CodeWithYu via YouTube

Tags

Apache Kafka Courses Amazon Web Services (AWS) Courses Apache Spark Courses Apache Flink Courses Data Engineering Courses Delta Lake Courses Minio Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn to design, implement, and maintain secure, scalable, and cost-effective lakehouse architectures in this comprehensive video tutorial. Explore advanced techniques using Apache Spark, Apache Kafka, Apache Flink, Delta Lake, AWS, and open-source tools to unlock data's full potential through analytics and machine learning. Follow step-by-step instructions to set up a Kafka Broker in KRaft mode, configure Minio, produce data into Kafka, acquire S3 access credentials, create an S3 Bucket Event Listener for the lakehouse, and preview the resulting data. Gain practical insights into real-time streaming and data engineering best practices for building robust, scalable data solutions.

Syllabus

Setting up Kafka Broker in KRaft Mode
Setting up Minio
Producing data into Kafka
Acquiring Secret and Access Key for S3
Creating S3 Bucket Event Listener for Lakehouse
Data Preview and Results
Outro


Taught by

CodeWithYu

Related Courses

Distributed Computing with Spark SQL
University of California, Davis via Coursera
Apache Spark (TM) SQL for Data Analysts
Databricks via Coursera
Building Your First ETL Pipeline Using Azure Databricks
Pluralsight
Implement a data lakehouse analytics solution with Azure Databricks
Microsoft via Microsoft Learn
Perform data science with Azure Databricks
Microsoft via Microsoft Learn