YoVDO

Efficiently Streaming Data into Medallion Architecture with Apache Hudi

Offered By: Confluent via YouTube

Tags

Data Lakes Courses Data Streaming Courses Medallion Architecture Courses Apache Hudi Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Discover how to efficiently stream data into your Medallion Architecture using Apache Hudi in this 42-minute conference talk from Confluent. Learn about the challenges of building a low-latency medallion architecture and how Apache Hudi, a transactional data lake platform, can address these issues. Explore the power of Hudi's new record-level index for faster upserts, and its database-style change data capture feature for improved incremental processing. Gain insights into efficiently graduating raw data through bronze, silver, and gold tables while avoiding computationally expensive operations. Walk away with knowledge on overcoming current technology limitations in lakehouses, understanding Hudi's record index and incremental updates, and leveraging new features to unlock efficient data processing on the lake. Perfect for data engineers and architects looking to optimize their data streaming and processing workflows in a medallion architecture.

Syllabus

A Glide, Skip or a Jump: Efficiently Stream Data into Your Medallion Architecture with Apache Hudi


Taught by

Confluent

Related Courses

History and Evolution of Data Lake Architecture - Post Lambda Architecture
Linux Foundation via YouTube
Transform Your Machine Learning Pipelines with Apache Hudi
Linux Foundation via YouTube
Delivering Portability to Open Data Lakes with Delta Lake UniForm
Databricks via YouTube
Fast Copy-On-Write in Apache Parquet for Data Lakehouse Upserts
Databricks via YouTube
Apache XTable - Interoperability Among Lakehouse Table Formats
Databricks via YouTube