The Practice and Optimization of Data Lake Iceberg in Xiaomi
Offered By: The ASF via YouTube
Course Description
Overview
Explore the implementation and optimization of Apache Iceberg data lake technology at Xiaomi in this 23-minute conference talk. Learn about Xiaomi's reasons for adopting Iceberg and its current production status. Discover how Xiaomi upgraded its business architecture using Iceberg, including enhancing Iceberg Parquet file filtering capabilities and evolving managed table optimization services. Gain insights into Iceberg's reading principles, the development of Parquet Page Index feature, and the integration of Parquet encryption for column-level data security. Understand the challenges faced before implementing managed table optimization services and the resulting system architecture. Finally, get a glimpse of Xiaomi's future plans for Iceberg, including index construction, hybrid cloud storage architecture, intelligent data lake warehouse, and cache acceleration strategies.
Syllabus
The Practice And Optimization Of Data Lake Iceberg In Xiaomi
Taught by
The ASF
Related Courses
Building Modern Data Streaming Apps with Open SourceLinux Foundation via YouTube How to Stabilize a GenAI-First Modern Data LakeHouse - Provisioning 20,000 Ephemeral Data Lakes per Year
CNCF [Cloud Native Computing Foundation] via YouTube Data Storage and Queries
DeepLearning.AI via Coursera Delivering Portability to Open Data Lakes with Delta Lake UniForm
Databricks via YouTube Fast Copy-On-Write in Apache Parquet for Data Lakehouse Upserts
Databricks via YouTube