YoVDO

Unity - Accelerating DNN Training Through Joint Optimization of Algebraic Transformations and Parallelization

Offered By: USENIX via YouTube

Tags

OSDI (Operating Systems Design and Implementation) Courses Formal Verification Courses Parallelization Courses

Course Description

Overview

Explore a conference talk from OSDI '22 that introduces Unity, a groundbreaking system for optimizing distributed Deep Neural Network (DNN) training. Delve into how Unity jointly optimizes algebraic transformations and parallelization using a unified parallel computation graph (PCG). Learn about the system's innovative approach to automatically generating and verifying optimizations, as well as its hierarchical search algorithm for maintaining scalability. Discover Unity's performance improvements over existing DNN training frameworks, with evaluations conducted on seven real-world DNNs using up to 192 GPUs across 32 nodes. Gain insights into the potential impact of Unity on accelerating DNN training and its availability as part of the open-source FlexFlow framework.

Syllabus

Introduction
Unitys Goal
Parallelization
Parallel Computation Graph
Data Parallelization
PCG Advantages
Techniques
Results
Conclusion


Taught by

USENIX

Related Courses

SPARK 2014
AdaCore via Independent
Automated Reasoning: Symbolic Model Checking
EIT Digital via Coursera
Software Testing and Verification
University System of Maryland via edX
Haskell for Imperative Programmers
YouTube
Model Checking and Temporal Logic - E. Allen Emerson's Turing Award Lecture
Association for Computing Machinery (ACM) via YouTube