The Practice and Optimization of Data Lake Iceberg in Xiaomi
Offered By: The ASF via YouTube
Course Description
Overview
Explore the implementation and optimization of Apache Iceberg data lake technology at Xiaomi in this 23-minute conference talk. Learn about Xiaomi's reasons for adopting Iceberg and its current production status. Discover how Xiaomi upgraded its business architecture using Iceberg, including enhancing Iceberg Parquet file filtering capabilities and evolving managed table optimization services. Gain insights into Iceberg's reading principles, the development of Parquet Page Index feature, and the integration of Parquet encryption for column-level data security. Understand the challenges faced before implementing managed table optimization services and the resulting system architecture. Finally, get a glimpse of Xiaomi's future plans for Iceberg, including index construction, hybrid cloud storage architecture, intelligent data lake warehouse, and cache acceleration strategies.
Syllabus
The Practice And Optimization Of Data Lake Iceberg In Xiaomi
Taught by
The ASF
Related Courses
Python for Data Science Tips, Tricks, & TechniquesLinkedIn Learning Sound Data Engineering in Rust - From Bits to DataFrames
Databricks via YouTube Recent Parquet Improvements in Apache Spark - Vectorized Complex Types and Column Index Support
Databricks via YouTube Optimizing Spark SQL Jobs with Parallel and Asynchronous IO
Databricks via YouTube Degrading Performance - Understanding and Solving Small Files Syndrome
Databricks via YouTube