YoVDO

Running Large-Scale Machine Learning on Google Kubernetes Engine

Offered By: Google Cloud Tech via YouTube

Tags

Machine Learning Courses Cloud Computing Courses GPU Computing Courses Scalability Courses Distributed Computing Courses Infrastructure Management Courses Containerization Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Discover how to run large-scale machine learning on Google Kubernetes Engine (GKE) in this 29-minute conference talk from Google Cloud Next. Learn about the challenges of training and serving large language models (LLMs) at scale, and explore how companies leverage GKE to build extensive ML models. Gain insights into GKE's solutions for common challenges in training large AI models, and uncover best practices for training and serving large-scale ML models on the platform. Speakers Nathan Beach and Alex Zakonov discuss GKE investments, ML training enablement, A3 VM, job queuing, GKE infrastructure, and key features that make GKE an ideal choice for large-scale machine learning operations.

Syllabus

Introduction
Challenges
Why customers use GKE
GKE Investments
How GKE enables ML training
A3 VM
Job queuing
GKE infrastructure
GKE features


Taught by

Google Cloud Tech

Related Courses

Financial Sustainability: The Numbers side of Social Enterprise
+Acumen via NovoEd
Cloud Computing Concepts: Part 2
University of Illinois at Urbana-Champaign via Coursera
Developing Repeatable ModelsĀ® to Scale Your Impact
+Acumen via Independent
Managing Microsoft Windows Server Active Directory Domain Services
Microsoft via edX
Introduction aux conteneurs
Microsoft Virtual Academy via OpenClassrooms