Accelerating Neural Recommendation Training with Embedding Scheduling
Offered By: USENIX via YouTube
Course Description
Overview
Explore a groundbreaking approach to accelerating distributed Deep Learning Recommendation Model (DLRM) training without compromising model accuracy. Delve into the concept of embedding scheduling, which proactively determines optimal embedding training locations and synchronization strategies. Learn about Herald, a real-time embedding scheduler designed to increase cache hit rates and decrease unnecessary updates, significantly reducing communication overhead. Discover how this innovative method leverages the predictability and infrequency of in-cache embedding accesses in distributed training systems. Examine the performance improvements achieved through adaptive location-aware input allocation and optimal communication plan generation. Gain insights into the potential for substantial reductions in embedding transmissions and notable performance enhancements in DLRM training across various network configurations.
Syllabus
NSDI '24 - Accelerating Neural Recommendation Training with Embedding Scheduling
Taught by
USENIX
Related Courses
Custom and Distributed Training with TensorFlowDeepLearning.AI via Coursera Architecting Production-ready ML Models Using Google Cloud ML Engine
Pluralsight Building End-to-end Machine Learning Workflows with Kubeflow
Pluralsight Deploying PyTorch Models in Production: PyTorch Playbook
Pluralsight Inside TensorFlow
TensorFlow via YouTube