Apache Iceberg Planning Explained - Efficient Job Execution for Large-Scale Data
Offered By: The ASF via YouTube
Course Description
Overview
Explore the efficient job planning process in Apache Iceberg through this 22-minute conference talk. Gain insights into how the table format rapidly identifies relevant files for queries, even with massive data volumes. Learn about the hybrid strategy that seamlessly transitions between local and distributed execution for optimal performance. Discover the design of Apache Iceberg metadata and its role in effective job execution. Benefit from this talk whether considering Apache Iceberg adoption or seeking to optimize existing production environments. Presented by Anton Okolnychyi, an Apache Iceberg Committer and PMC Member, as well as an Apache Spark contributor at Apple.
Syllabus
Apache Iceberg Planning Explained
Taught by
The ASF
Related Courses
Building Modern Data Streaming Apps with Open SourceLinux Foundation via YouTube How to Stabilize a GenAI-First Modern Data LakeHouse - Provisioning 20,000 Ephemeral Data Lakes per Year
CNCF [Cloud Native Computing Foundation] via YouTube Data Storage and Queries
DeepLearning.AI via Coursera Delivering Portability to Open Data Lakes with Delta Lake UniForm
Databricks via YouTube Fast Copy-On-Write in Apache Parquet for Data Lakehouse Upserts
Databricks via YouTube