YoVDO

Real-Time Data Integration Practice Based on Flink CDC at Alibaba Cloud

Offered By: The ASF via YouTube

Tags

Apache Flink Courses SQL Courses Data Lakes Courses Data Streaming Courses Alibaba Cloud Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore real-time data integration practices using Flink CDC at Alibaba Cloud in this 25-minute conference talk. Delve into the core design and key implementation of Flink CDC technology, with a focus on the new features introduced in version 2.4.0. Learn about the technical advantages of Flink CDC, including full incremental integration, lock-free reading, concurrent reading, and distributed architecture. Discover how Flink CDC supports powerful data processing capabilities, allowing for real-time association, aggregation, and flattening of database data using SQL. Gain insights into Alibaba Cloud's internal Flink CDC solutions for addressing specific business challenges, such as data lake and warehouse integration scenarios and binlog expiration issues. Understand how processed data can be seamlessly written to downstream systems like Kafka, Hudi, Iceberg, and Doris, enabling efficient real-time data lake and warehouse integration.

Syllabus

Real-Time Data Integration Practice Based On Flink Cdc At Alibaba Cloud


Taught by

The ASF

Related Courses

Developing Stream Processing Applications with AWS Kinesis
Pluralsight
Developing Stream Processing Applications with AWS Kinesis
Pluralsight
Conceptualizing the Processing Model for the AWS Kinesis Data Analytics Service
Pluralsight
Processing Streaming Data Using Apache Flink
Pluralsight
Complex Event Processing Using Apache Flink
Pluralsight