Benchmarks and How-Tos for Convolutional Neural Networks on HorovodRunner-Enabled Apache Spark Clusters
Offered By: Databricks via YouTube
Course Description
Overview
Explore benchmarks and implementation techniques for Convolutional Neural Networks using HorovodRunner on Apache Spark clusters in this 25-minute Databricks talk. Learn about the scaling efficiency of Horovod Runner for CNN-based tasks on both GPU and CPU clusters, and discover the first-time implementation of the Rectified Adam optimizer. Gain insights into cluster settings, distributed model retrieval, training time measurement, Rectified Adam usage, and factors affecting scaling efficiency. Understand how smaller companies can leverage distributed deep learning for competitive advantages against tech giants through fast iterations on accessible Spark clusters.
Syllabus
Introduction
Why a Distributed Deep Learning System
Data Parallelization
Benchmarks
HorovodRunner Demo
Benchmarking
Graph
Ubers
Applet
Other Models
Summary
Taught by
Databricks
Related Courses
Challenges and Opportunities in Applying Machine Learning - Alex Jaimes - ODSC East 2018Open Data Science via YouTube Efficient Distributed Deep Learning Using MXNet
Simons Institute via YouTube SHADE - Enable Fundamental Cacheability for Distributed Deep Learning Training
USENIX via YouTube Alpa - Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning
USENIX via YouTube Horovod - Distributed Deep Learning for Reliable MLOps
Linux Foundation via YouTube