Next-Generation Networks for Machine Learning
Offered By: Scalable Parallel Computing Lab, SPCL @ ETH Zurich via YouTube
Course Description
Overview
Explore cutting-edge techniques for accelerating distributed deep neural network (DNN) training in this 50-minute conference talk by Manya Ghobadi at SPCL_Bcast. Delve into the challenges posed by increasing dataset and model sizes, and discover innovative solutions to overcome network bottlenecks in datacenter environments. Learn about a novel optical fabric that optimizes network topology and parallelization strategies for DNN clusters. Examine the limitations of fair-sharing in congestion control algorithms and understand a new scheduling approach that strategically places jobs on network links to enhance performance. Gain insights into the future of machine learning infrastructure and network design for improved training efficiency.
Syllabus
Introduction
Talk
Announcements
Taught by
Scalable Parallel Computing Lab, SPCL @ ETH Zurich
Related Courses
An Introduction to Computer NetworksStanford University via Independent Computer Networks and the Internet
Kiron via edX IT Support: Networking Essentials
Microsoft via edX Digital Switching - I
Indian Institute of Technology Kanpur via Swayam How To Build a Network Topology Using GNS3
Coursera Project Network via Coursera