YoVDO

Delta Lake: Optimizing Merge Operations

Offered By: Databricks via YouTube

Tags

Delta Lake Courses Big Data Courses Data Management Courses Performance Tuning Courses Data Engineering Courses

Course Description

Overview

Dive into the intricacies of Delta Lake's merge operation in this 24-minute talk from Databricks. Explore the underlying mechanics of merge, learn optimization techniques, and gain insights through code snippets and sample configurations. Understand the basics of merge, including inner and full outer joins, and discover practical tips for handling large merges. Examine partition and file pruning examples, operation metrics, and best practices for S3 bucket usage. Enhance your knowledge of Delta Lake and improve your data management skills with this informative session from the Databricks Summit Europe.

Syllabus

SUMMIT EUROPE
Merge overview
Merge basics • Tale of two joins: inner join and full outer join
Partition Prune Example
File Prune Example
Operation Metrics continued
Large merge tips s3 bucket: write at the root-53 parallelism is defined by the Each large table should have its own s3 bucket and anoth
Final recap
Feedback


Taught by

Databricks

Related Courses

Web Intelligence and Big Data
Indian Institute of Technology Delhi via Coursera
Big Data for Better Performance
Open2Study
Big Data and Education
Columbia University via edX
Big Data Analytics in Healthcare
Georgia Institute of Technology via Udacity
Data Mining with Weka
University of Waikato via Independent