YoVDO

Spark SQL Shuffle Join Optimization at eBay

Offered By: The ASF via YouTube

Tags

Apache Spark Courses SQL Courses Data Warehousing Courses JOIN Operations Courses eBay Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a 15-minute conference talk on Spark SQL Shuffle Join improvements implemented at eBay. Discover how Wang Yuming, an eBay software engineer and Apache Spark PMC Member, presents a series of optimizations for one of the most expensive and widely used operations in data warehouses. Learn about three key enhancements: unwrapping join conditions to utilize bucket joins, enhancing shuffle exchange reuse to minimize table scans, and pushing down partial aggregation through joins. Gain insights into SQL query performance optimization techniques from an expert in Apache Spark development and a 2022 SIGMOD Systems Award winner.

Syllabus

Spark Sql Shuffle Join Improvement At Ebay


Taught by

The ASF

Related Courses

Introduction to Databases
Meta via Coursera
Web Development
Udacity
Introduction to Data Science
University of Washington via Coursera
Datenmanagement mit SQL
openHPI
Sabermetrics 101: Introduction to Baseball Analytics
Boston University via edX