Trillion Parameter Models Are Here
Offered By: Edan Meyer via YouTube
Course Description
Overview
Explore the groundbreaking advancements in training large-scale Machine Learning models through Microsoft's ZeRO-Infinity technology in this 27-minute video. Learn how this innovation overcomes previous GPU memory limitations, enabling the training of models with trillions of parameters using modest GPU resources and fine-tuning billion-parameter models on a single GPU. Discover the implications for working with extensive models like GPT-2 and understand the technical aspects of ZeRO-Infinity, including its forward step and parallelization techniques. Delve into the results and potential applications of this technology, which promises to revolutionize deep learning training by unlocking unprecedented model scale.
Syllabus
Intro
Motivation
Paper
Forward Step
Parallelization
Results
Taught by
Edan Meyer
Related Courses
Gérez des flux de données temps réelCentraleSupélec via OpenClassrooms 現役シリコンバレーエンジニアが教えるPython 3 入門 + 応用 +アメリカのシリコンバレー流コードスタイル
Udemy Selenium WebDriver 4, Cucumber BDD, Java & More! [NEW: 2023]
Udemy Advanced Data and Stream Processing with Microsoft TPL Dataflow
Pluralsight Amazon Simple Storage Service (Amazon S3) Performance Optimization (German)
Amazon Web Services via AWS Skill Builder