YoVDO

The Practice and Optimization of Data Lake Iceberg in Xiaomi

Offered By: The ASF via YouTube

Tags

Data Lakes Courses Data Encryption Courses Parquet Courses Apache Iceberg Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the implementation and optimization of Apache Iceberg data lake technology at Xiaomi in this 23-minute conference talk. Learn about Xiaomi's reasons for adopting Iceberg and its current production status. Discover how Xiaomi upgraded its business architecture using Iceberg, including enhancing Iceberg Parquet file filtering capabilities and evolving managed table optimization services. Gain insights into Iceberg's reading principles, the development of Parquet Page Index feature, and the integration of Parquet encryption for column-level data security. Understand the challenges faced before implementing managed table optimization services and the resulting system architecture. Finally, get a glimpse of Xiaomi's future plans for Iceberg, including index construction, hybrid cloud storage architecture, intelligent data lake warehouse, and cache acceleration strategies.

Syllabus

The Practice And Optimization Of Data Lake Iceberg In Xiaomi


Taught by

The ASF

Related Courses

Building Modern Data Streaming Apps with Open Source
Linux Foundation via YouTube
How to Stabilize a GenAI-First Modern Data LakeHouse - Provisioning 20,000 Ephemeral Data Lakes per Year
CNCF [Cloud Native Computing Foundation] via YouTube
Data Storage and Queries
DeepLearning.AI via Coursera
Delivering Portability to Open Data Lakes with Delta Lake UniForm
Databricks via YouTube
Fast Copy-On-Write in Apache Parquet for Data Lakehouse Upserts
Databricks via YouTube