YoVDO

The Practice and Optimization of Data Lake Iceberg in Xiaomi

Offered By: The ASF via YouTube

Tags

Data Lakes Courses Data Encryption Courses Parquet Courses Apache Iceberg Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the implementation and optimization of Apache Iceberg data lake technology at Xiaomi in this 23-minute conference talk. Learn about Xiaomi's reasons for adopting Iceberg and its current production status. Discover how Xiaomi upgraded its business architecture using Iceberg, including enhancing Iceberg Parquet file filtering capabilities and evolving managed table optimization services. Gain insights into Iceberg's reading principles, the development of Parquet Page Index feature, and the integration of Parquet encryption for column-level data security. Understand the challenges faced before implementing managed table optimization services and the resulting system architecture. Finally, get a glimpse of Xiaomi's future plans for Iceberg, including index construction, hybrid cloud storage architecture, intelligent data lake warehouse, and cache acceleration strategies.

Syllabus

The Practice And Optimization Of Data Lake Iceberg In Xiaomi


Taught by

The ASF

Related Courses

Python for Data Science Tips, Tricks, & Techniques
LinkedIn Learning
Sound Data Engineering in Rust - From Bits to DataFrames
Databricks via YouTube
Recent Parquet Improvements in Apache Spark - Vectorized Complex Types and Column Index Support
Databricks via YouTube
Optimizing Spark SQL Jobs with Parallel and Asynchronous IO
Databricks via YouTube
Degrading Performance - Understanding and Solving Small Files Syndrome
Databricks via YouTube