Distributed Training - Part I - Lecture 17
Offered By: MIT HAN Lab via YouTube
Course Description
Overview
Explore distributed training techniques in machine learning with this comprehensive lecture from MIT's 6.5940 course. Delve into the first part of distributed training, led by Professor Song Han, as part of the EfficientML.ai series. Learn about the fundamental concepts, challenges, and strategies for scaling machine learning models across multiple devices or nodes. Gain insights into parallel processing, data parallelism, and model parallelism techniques that enable training of large-scale models efficiently. Understand the importance of distributed training in modern AI applications and its impact on accelerating the development of complex neural networks. Access accompanying slides at efficientml.ai to enhance your learning experience and follow along with the lecture content.
Syllabus
EfficientML.ai Lecture 17: Distributed Training (Part I) (MIT 6.5940, Fall 2023, Zoom)
Taught by
MIT HAN Lab
Related Courses
Intro to Parallel ProgrammingNvidia via Udacity Introduction to Linear Models and Matrix Algebra
Harvard University via edX Введение в параллельное программирование с использованием OpenMP и MPI
Tomsk State University via Coursera Supercomputing
Partnership for Advanced Computing in Europe via FutureLearn Fundamentals of Parallelism on Intel Architecture
Intel via Coursera