YoVDO

Turing-NLG, DeepSpeed and the ZeRO Optimizer

Offered By: Yannic Kilcher via YouTube

Tags

Artificial Intelligence Courses Deep Learning Courses Parallel Computing Courses Model Optimization Courses

Course Description

Overview

Explore Microsoft's groundbreaking 17-billion parameter language model and the innovative ZeRO optimizer in this informative video. Dive into the technical details of how ZeRO enables efficient model and data parallelism without sacrificing training speed. Learn about the Turing-NLG model's state-of-the-art perplexity achievements and the DeepSpeed framework that powers it. Gain insights into the latest advancements in large-scale language model training and optimization techniques that are pushing the boundaries of natural language processing.

Syllabus

Turing-NLG, DeepSpeed and the ZeRO optimizer


Taught by

Yannic Kilcher

Related Courses

Intro to Parallel Programming
Nvidia via Udacity
Introduction to Linear Models and Matrix Algebra
Harvard University via edX
Введение в параллельное программирование с использованием OpenMP и MPI
Tomsk State University via Coursera
Supercomputing
Partnership for Advanced Computing in Europe via FutureLearn
Fundamentals of Parallelism on Intel Architecture
Intel via Coursera