YoVDO

EnvPipe - Performance-Preserving DNN Training Framework for Saving Energy

Offered By: USENIX via YouTube

Tags

USENIX Annual Technical Conference Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a conference talk from USENIX ATC '23 that introduces EnvPipe, an innovative DNN training framework designed to save energy without compromising performance. Learn how EnvPipe maximizes energy efficiency in multi-GPU DNN training by leveraging slack time created by pipeline parallelism bubbles. Discover the framework's approach to stretching execution time of pipeline units through SM frequency reduction, while maintaining the original accuracy of training tasks. Gain insights into EnvPipe's implementation as a PyTorch library and its impressive energy-saving results: up to 25.2% in single-node training with 4 GPUs and 28.4% in multi-node training with 16 GPUs, all while keeping performance degradation below 1%. Understand the significance of this research in addressing energy consumption challenges in data centers, particularly for DNN training and inference services.

Syllabus

USENIX ATC '23 - EnvPipe: Performance-preserving DNN Training Framework for Saving Energy


Taught by

USENIX

Related Courses

Amazon DynamoDB - A Scalable, Predictably Performant, and Fully Managed NoSQL Database Service
USENIX via YouTube
Faasm - Lightweight Isolation for Efficient Stateful Serverless Computing
USENIX via YouTube
AC-Key - Adaptive Caching for LSM-based Key-Value Stores
USENIX via YouTube
The Future of the Past - Challenges in Archival Storage
USENIX via YouTube
A Decentralized Blockchain with High Throughput and Fast Confirmation
USENIX via YouTube