Apache Impala: Reading, Modifying, and Optimizing Iceberg Tables
Offered By: The ASF via YouTube
Course Description
Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Discover how Apache Impala has evolved to meet modern data warehouse requirements in this 26-minute conference talk from The Apache Software Foundation. Learn about Impala's new capabilities for reading, modifying, and optimizing Apache Iceberg tables, including row-level modifications and table maintenance features. Explore how Impala now supports RDBMS-like functionalities, such as compliance with GDPR and CCPA regulations through record removal and updates. Understand the benefits of the OPTIMIZE statement for merging small data files and eliminating delete files to maintain table health. Gain insights into the DROP PARTITION statement for selective partition removal based on predicates. Presented by Cloudera engineers Zoltán Borók-Nagy, Péter Rózsa, and Noémi Pap-Takács, this talk demonstrates how Impala has adapted to emerging requirements while maintaining its focus on performance in distributed, massively parallel query execution for big data.
Syllabus
This Impala not only reads, but modifies and optimizes Iceberg tables
Taught by
The ASF
Related Courses
Cloud Computing Law: Data Protection and CybersecurityQueen Mary University of London via Coursera Data Privacy and Protection Standards
Coursera Instructor Network via Coursera Ethics and AI: A Philosophical Guide to Responsible Use
Northeastern University via Coursera Information Technologies and Security in Healthcare Settings
(ISC)² via Coursera Cybersecurity Compliance and Framework
IBM via edX