Deep Dive into GPU Support in Apache Spark 3.x - Accelerator-Aware Scheduling and RAPIDS Plugin
Offered By: Databricks via YouTube
Course Description
Overview
Dive deep into GPU support in Apache Spark 3.x with this comprehensive 47-minute video from Databricks. Explore accelerator-aware task scheduling, columnar data processing support, fractional scheduling, and stage-level resource scheduling and configuration. Learn about the Apache Spark 3.x RAPIDS plugin, enabling GPU acceleration without code changes. Understand how the Catalyst optimizer physical plan is modified for GPU-aware scheduling and how the plugin leverages RAPIDS libraries. Discover optimizations made to the shuffle plugin using UCX for GPU memory communication. Gain insights into future optimizations involving RDMA and GPU Direct Storage. Examine industry-standard benchmarks and real-world production dataset performance. Cover topics such as GPU scheduling, discovery scripts, assignments API, UI, stage-level scheduling, SOL columnar processing, Project Hydrogen, Deep Learning Recommendation Machines, and accelerated shuffle results.
Syllabus
Intro
Accelerator-Aware Scheduling
GPU Scheduling Example
GPU Discovery Script Example
GPU Assignments API
GPU Scheduling UI
Stage Level Scheduling
SOL Columnar Processing
Spark 3 with Project Hydrogen
Deep Learning Recommendation Machines
RAPIDS Accelerator for Apache Spark (Plugin)
Is This a Silver Bullet?
But It Can Be Amazing
Spark SOL & DataFrame Compilation Flow
ETL Technology Stack
Demo Cluster Setup
Databricks Demo Results
T4 Cluster Setup
RAPIDS Accelerator on AWS
Spark Shuffle
Accelerated Shuffle Results
What's Next
Taught by
Databricks
Related Courses
Advanced R ProgrammingJohns Hopkins University via Coursera Analyse comparative des volumes Amazon EBS (Français) | Benchmarking Amazon EBS Volumes (French)
Amazon Web Services via AWS Skill Builder Analyzing the Internal/External Competitive Profile Matrix
Coursera Project Network via Coursera Assessing Cultural Climate
Rice University via Coursera AWS Foundations: Cost Management (Simplified Chinese)
Amazon Web Services via AWS Skill Builder