Architecture Analysis for ETL Processing - CPU vs GPU
Offered By: Databricks via YouTube
Course Description
Overview
Explore a 19-minute conference talk on GPU acceleration for batch ETL operations. Compare CPU and GPU architectures, including memory subsystems, and analyze database operations like joins, aggregations, and data compression. Discover why these operations are well-suited for GPU acceleration, potentially achieving up to 10x speedup. Examine industry-standard benchmark queries demonstrating full end-to-end SQL query acceleration using GPUs in a prototype query engine, comparing results to existing CPU solutions. Review performance of the same queries at a 3TB scale using the RAPIDS Accelerator for Apache Sparkā¢, enabling GPU acceleration without code changes. Presented by Jason Lowe, Distinguished System Software Engineer at NVIDIA, and Nikolay Sakharnykh, Senior AI Developer Technology Manager at NVIDIA.
Syllabus
Architecture Analysis for ETL Processing: CPU vs GPU
Taught by
Databricks
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera