Apache Paimon Stream Data Lake: CDC Feed and Stream Read
Offered By: The ASF via YouTube
Course Description
Overview
Explore the capabilities of Apache Paimon (incubating), a cutting-edge streaming data lake storage technology, in this 36-minute conference talk by Li Jinsong, an Alibaba Senior technical specialist and PMC member of Apache Flink. Dive into the world of high-throughput, low-latency data intake, streaming subscriptions, and real-time query functionalities. Discover how Paimon's open data format and technology concept seamlessly integrate with leading computing engines like Apache Flink, Spark, and Trino. Learn about key features including CDC Schema Evolution into lake, CDC entire vault into the lake, CDC into the lake part of the column update, and real-time change log stream reading. Gain valuable insights into the future of flow lake storage technology from an industry expert who has extensive experience in distributed flow computing, distributed batch computing, and lake storage.
Syllabus
Apache Paimon Stream Data Lake: Cdc Feed Lake And Stream Read
Taught by
The ASF
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera