YoVDO

Realtime Streaming with Data Lakehouse - End-to-End Data Engineering Project

Offered By: CodeWithYu via YouTube

Tags

Apache Kafka Courses Amazon Web Services (AWS) Courses Apache Spark Courses Apache Flink Courses Data Engineering Courses Delta Lake Courses Minio Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn to design, implement, and maintain secure, scalable, and cost-effective lakehouse architectures in this comprehensive video tutorial. Explore advanced techniques using Apache Spark, Apache Kafka, Apache Flink, Delta Lake, AWS, and open-source tools to unlock data's full potential through analytics and machine learning. Follow step-by-step instructions to set up a Kafka Broker in KRaft mode, configure Minio, produce data into Kafka, acquire S3 access credentials, create an S3 Bucket Event Listener for the lakehouse, and preview the resulting data. Gain practical insights into real-time streaming and data engineering best practices for building robust, scalable data solutions.

Syllabus

Setting up Kafka Broker in KRaft Mode
Setting up Minio
Producing data into Kafka
Acquiring Secret and Access Key for S3
Creating S3 Bucket Event Listener for Lakehouse
Data Preview and Results
Outro


Taught by

CodeWithYu

Related Courses

Object Storage Driven Machine Learning Workloads
Linux Foundation via YouTube
Writing Machine Learning Pipelines Against Object Storage
Linux Foundation via YouTube
Introduction to KubeFlow: Using and Use Cases
Linux Foundation via YouTube
Building a Cloud Native Storage Service - Dropbox Example
Linux Foundation via YouTube
The Fallacies of Distributed Computing
Gopher Academy via YouTube