Streaming ETL on the Shoulders of Giants
Offered By: Devoxx via YouTube
Course Description
Overview
Explore the world of streaming ETL in this 46-minute Devoxx conference talk. Dive into the importance of bridging the gap between data in motion and data at rest, with a focus on Apache Kafka as a central nervous system for company-wide data architectures. Learn about building robust data integration pipelines between MongoDB and Apache Kafka using the Kafka Connect framework. Discover configuration-based data in motion scenarios and streaming ETL pipeline examples that can be implemented without coding. Gain insights into topics such as the diminishing value of data, Data Fabric, Kafka APIs, Source Connectors, and achieving a Single Source of Truth. Understand how to synchronize data across services and see practical examples of Source Connector implementation.
Syllabus
Introduction
The diminishing value of data
Data Fabric
Kafka
Kafka API
Kafka Connect
Source Connectors
Single Source of Truth
Synchronize Data Across Services
Source Connector
Example
Taught by
Devoxx
Related Courses
Web sémantique et Web de donnéesInria (French Institute for Research in Computer Science and Automation) via France Université Numerique Linked Data Engineering
openHPI Implementing ETL with SQL Server Integration Services
Microsoft via edX Advanced Manufacturing Enterprise
University at Buffalo via Coursera Big Data Services: Capstone Project
Yandex via Coursera