Spark SQL Shuffle Join Optimization at eBay
Offered By: The ASF via YouTube
Course Description
Overview
Explore a 15-minute conference talk on Spark SQL Shuffle Join improvements implemented at eBay. Discover how Wang Yuming, an eBay software engineer and Apache Spark PMC Member, presents a series of optimizations for one of the most expensive and widely used operations in data warehouses. Learn about three key enhancements: unwrapping join conditions to utilize bucket joins, enhancing shuffle exchange reuse to minimize table scans, and pushing down partial aggregation through joins. Gain insights into SQL query performance optimization techniques from an expert in Apache Spark development and a 2022 SIGMOD Systems Award winner.
Syllabus
Spark Sql Shuffle Join Improvement At Ebay
Taught by
The ASF
Related Courses
Introduction to DatabasesMeta via Coursera Web Development
Udacity Introduction to Data Science
University of Washington via Coursera Datenmanagement mit SQL
openHPI Sabermetrics 101: Introduction to Baseball Analytics
Boston University via edX