YoVDO

Spark RAPIDS ML - GPU Accelerated Distributed Machine Learning in Spark Clusters

Offered By: Databricks via YouTube

Tags

GPU Acceleration Courses Machine Learning Courses Apache Spark Courses Distributed Computing Courses MLlib Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Discover the power of GPU acceleration for distributed machine learning in Spark clusters through this 38-minute conference talk by Erik Ordentlich and Jinfeng Li from NVIDIA. Learn about Spark RAPIDS ML, an open-source Python package that enables GPU acceleration of Spark distributed machine learning applications. Explore how this package, built upon the RAPIDS cuML library, implements GPU-accelerated versions of classical ML algorithms for regression, classification, clustering, and dimensionality reduction. Understand the benefits of Spark RAPIDS ML, including its compatibility with Spark MLlib DataFrame API and impressive benchmark results showing up to 100x speedup and 50x cost savings over baseline Spark MLlib in compute-intensive scenarios. Gain insights into the evolution of Spark MLlib and how Spark RAPIDS ML leverages modern computing accelerators like GPUs to enhance performance.

Syllabus

Spark RAPIDS ML: GPU Accelerated Distributed ML in Spark Clusters


Taught by

Databricks

Related Courses

Cloud Computing Concepts, Part 1
University of Illinois at Urbana-Champaign via Coursera
Cloud Computing Concepts: Part 2
University of Illinois at Urbana-Champaign via Coursera
Reliable Distributed Algorithms - Part 1
KTH Royal Institute of Technology via edX
Introduction to Apache Spark and AWS
University of London International Programmes via Coursera
Réalisez des calculs distribués sur des données massives
CentraleSupélec via OpenClassrooms