YoVDO

Apache Iceberg Planning Explained - Efficient Job Execution for Large-Scale Data

Offered By: The ASF via YouTube

Tags

Big Data Courses Data Warehousing Courses Data Lakes Courses File Management Courses Scalability Courses Distributed Computing Courses Apache Iceberg Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the efficient job planning process in Apache Iceberg through this 22-minute conference talk. Gain insights into how the table format rapidly identifies relevant files for queries, even with massive data volumes. Learn about the hybrid strategy that seamlessly transitions between local and distributed execution for optimal performance. Discover the design of Apache Iceberg metadata and its role in effective job execution. Benefit from this talk whether considering Apache Iceberg adoption or seeking to optimize existing production environments. Presented by Anton Okolnychyi, an Apache Iceberg Committer and PMC Member, as well as an Apache Spark contributor at Apple.

Syllabus

Apache Iceberg Planning Explained


Taught by

The ASF

Related Courses

Building Modern Data Streaming Apps with Open Source
Linux Foundation via YouTube
How to Stabilize a GenAI-First Modern Data LakeHouse - Provisioning 20,000 Ephemeral Data Lakes per Year
CNCF [Cloud Native Computing Foundation] via YouTube
Data Storage and Queries
DeepLearning.AI via Coursera
Delivering Portability to Open Data Lakes with Delta Lake UniForm
Databricks via YouTube
Fast Copy-On-Write in Apache Parquet for Data Lakehouse Upserts
Databricks via YouTube