YoVDO

Impala Performance on Iceberg Tables - Optimizations and Benchmarks

Offered By: The ASF via YouTube

Tags

Big Data Courses Cloudera Courses Data Storage Courses Apache Iceberg Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the performance of Apache Impala on Iceberg tables in this 27-minute conference talk from The ASF. Dive into the implementation details of Impala's optimized C++ approach for reading Iceberg tables, contrasting it with other engines that rely on the Iceberg library. Discover how Impala efficiently handles delete files in Iceberg tables, implementing new Iceberg-specific operators for improved query performance. Gain insights into Impala's architecture, Iceberg's structure, and the performance enhancements specifically designed for Iceberg integration. Compare Impala's performance against other open-source query engines through detailed measurements. By the end, acquire a high-level understanding of both Impala and Iceberg architectures, along with Impala's competitive edge in querying Iceberg tables with position delete files.

Syllabus

Let’s see how fast Impala runs on Iceberg


Taught by

The ASF

Related Courses

Big Data: from Data to Decisions
Queensland University of Technology via FutureLearn
Big Data: adquisición y almacenamiento de datos
Universitat Autònoma de Barcelona (Autonomous University of Barcelona) via Coursera
Practical Guide to setup Hadoop and Spark Cluster using CDH
Udemy
Cloudera Hadoop Administration
YouTube
Cloudera Data Platform
YouTube