The Practice and Optimization of Data Lake Iceberg in Xiaomi
Offered By: The ASF via YouTube
Course Description
Overview
Explore the implementation and optimization of Apache Iceberg data lake technology at Xiaomi in this 23-minute conference talk. Learn about Xiaomi's reasons for adopting Iceberg and its current production status. Discover how Xiaomi upgraded its business architecture using Iceberg, including enhancing Iceberg Parquet file filtering capabilities and evolving managed table optimization services. Gain insights into Iceberg's reading principles, the development of Parquet Page Index feature, and the integration of Parquet encryption for column-level data security. Understand the challenges faced before implementing managed table optimization services and the resulting system architecture. Finally, get a glimpse of Xiaomi's future plans for Iceberg, including index construction, hybrid cloud storage architecture, intelligent data lake warehouse, and cache acceleration strategies.
Syllabus
The Practice And Optimization Of Data Lake Iceberg In Xiaomi
Taught by
The ASF
Related Courses
Data Lakes for Big DataEdCast Distributed Computing with Spark SQL
University of California, Davis via Coursera Modernizing Data Lakes and Data Warehouses with Google Cloud
Google Cloud via Coursera Data Engineering with AWS
Udacity Preparing for Google Cloud Certification: Cloud Data Engineer
Google Cloud via Coursera