YoVDO

Accelerating Neural Recommendation Training with Embedding Scheduling

Offered By: USENIX via YouTube

Tags

Distributed Training Courses Neural Networks Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a groundbreaking approach to accelerating distributed Deep Learning Recommendation Model (DLRM) training without compromising model accuracy. Delve into the concept of embedding scheduling, which proactively determines optimal embedding training locations and synchronization strategies. Learn about Herald, a real-time embedding scheduler designed to increase cache hit rates and decrease unnecessary updates, significantly reducing communication overhead. Discover how this innovative method leverages the predictability and infrequency of in-cache embedding accesses in distributed training systems. Examine the performance improvements achieved through adaptive location-aware input allocation and optimal communication plan generation. Gain insights into the potential for substantial reductions in embedding transmissions and notable performance enhancements in DLRM training across various network configurations.

Syllabus

NSDI '24 - Accelerating Neural Recommendation Training with Embedding Scheduling


Taught by

USENIX

Related Courses

Custom and Distributed Training with TensorFlow
DeepLearning.AI via Coursera
Architecting Production-ready ML Models Using Google Cloud ML Engine
Pluralsight
Building End-to-end Machine Learning Workflows with Kubeflow
Pluralsight
Deploying PyTorch Models in Production: PyTorch Playbook
Pluralsight
Inside TensorFlow
TensorFlow via YouTube