Efficiently Stream Data into Your Medallion Architecture with Apache Hudi
Offered By: The ASF via YouTube
Course Description
Overview
Discover how to efficiently stream data into a medallion architecture using Apache Hudi in this 42-minute conference talk from The ASF. Learn about the challenges of building a medallion architecture with streaming data sources and how Apache Hudi, a transactional data lake platform, addresses these issues. Explore Hudi's new record-level index for faster upsert performance and its database-style change data capture feature. Gain insights into how the record index and incremental processing work in Hudi, and understand how the CDC feature enables incremental processing on the lake. Presented by Ethan Guo, an Apache Hudi committer and Database Engineer at Onehouse, this talk provides valuable knowledge for those interested in optimizing data streaming and Lakehouse architecture.
Syllabus
A glide, skip or a jump: Efficiently stream data into your medallion architecture with Apache Hudi
Taught by
The ASF
Related Courses
Google Cloud Big Data and Machine Learning Fundamentals en EspañolGoogle Cloud via Coursera Big Data Emerging Technologies
Yonsei University via Coursera Building Resilient Streaming Systems on GCP em Português Brasileiro
Google Cloud via Coursera Building Resilient Streaming Systems on Google Cloud Platform en Español
Google Cloud via Coursera AWS Certified Data Analytics Specialty 2024 - Hands On!
Udemy