YoVDO

Realtime Data Streaming - End-to-End Data Engineering Project

Offered By: CodeWithYu via YouTube

Tags

Data Engineering Courses Docker Courses Apache Spark Courses Apache Airflow Courses Apache Kafka Courses Apache ZooKeeper Courses ETL Pipelines Courses Kafka Connect Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Build a real-time data streaming pipeline in this comprehensive end-to-end data engineering project video. Learn to set up a data pipeline with Apache Airflow, stream data using Kafka and Kafka Connect, utilize Zookeeper for distributed synchronization, process data with Apache Spark, and implement data storage solutions with Cassandra and PostgreSQL. Master containerization of your data engineering environment using Docker. Follow along as the instructor guides you through each phase, from data ingestion to processing and storage, using a powerful stack of tools and technologies. Gain hands-on experience with system architecture, API data retrieval, Docker Compose setup, and streaming data into Kafka and Cassandra. Perfect for aspiring data engineers and professionals looking to enhance their skills in building scalable, real-time data processing systems.

Syllabus

Introduction
System architecture
Getting data from API with Airflow
Docker Compose for the architecture
Streaming data into Kafka
Apache Spark and Cassandra setup
Streaming data into cassandra
Outro


Taught by

CodeWithYu

Related Courses

Developing Distributed Applications Using ZooKeeper
Cognitive Class
Distributed Systems & Cloud Computing with Java
Udemy
A Gentle Introduction to Event Streaming and Processing Using Apache Pulsar
All Things Open via YouTube
A ZooKeeper Layer for FoundationDB - Paul Hemberger, HubSpot
Linux Foundation via YouTube
Event Streaming for the Best of All Worlds
Spring I/O via YouTube