YoVDO

Machine Learning Infrastructure at Meta Scale

Offered By: MLOps World: Machine Learning in Production via YouTube

Tags

Machine Learning Courses PyTorch Courses MLOps Courses Recommendation Systems Courses Scalability Courses Distributed Training Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the challenges and solutions in scaling machine learning infrastructure at Meta in this 37-minute conference talk from MLOps World: Machine Learning in Production. Gain insights from Shivam Bharuka, Senior AI Infra Engineer at Meta, as he shares his experience in supporting large-scale ranking and recommendation models serving over a billion users. Discover how Meta reimagined its entire AI Infrastructure stack to accommodate rapidly growing machine learning models. Learn about the development of specialized hardware using powerful GPUs and network devices, as well as the design of optimized distributed training algorithms using PyTorch. Understand the approach taken to redesign and scale the stack, addressing performance, reliability, and efficiency concerns in machine learning training infrastructure.

Syllabus

Machine Learning Infrastructure at Meta Scale


Taught by

MLOps World: Machine Learning in Production

Related Courses

Financial Sustainability: The Numbers side of Social Enterprise
+Acumen via NovoEd
Cloud Computing Concepts: Part 2
University of Illinois at Urbana-Champaign via Coursera
Developing Repeatable ModelsĀ® to Scale Your Impact
+Acumen via Independent
Managing Microsoft Windows Server Active Directory Domain Services
Microsoft via edX
Introduction aux conteneurs
Microsoft Virtual Academy via OpenClassrooms