YoVDO

Scaling Kubernetes Clusters for Generative Models - Managing GPU Resources for AI Applications

Offered By: Linux Foundation via YouTube

Tags

Kubernetes Courses Generative AI Courses Scalability Courses Container Orchestration Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore techniques for efficiently scaling Generative AI workloads using Kubernetes in this 32-minute talk by Jack Min Ong from Jina AI. Delve into the challenges of GPU resource management and learn how to leverage Kubernetes coupled with the NVIDIA GPU operator to configure and consume GPU resources at scale. Discover various methods for sharding GPU devices and optimizing GPU usage in generative model pipelines. Gain a comprehensive understanding of provisioning and sharing GPU resources across multiple containers, enabling you to maximize GPU investments and accelerate Generative AI applications.

Syllabus

Scaling Kubernetes Clusters for Generative Models: Managing GPU Resources for AI App... Jack Min Ong


Taught by

Linux Foundation

Tags

Related Courses

Building and Managing Superior Skills
State University of New York via Coursera
ChatGPT et IA : mode d'emploi pour managers et RH
CNAM via France Université Numerique
Digital Skills: Artificial Intelligence
Accenture via FutureLearn
AI Foundations for Everyone
IBM via Coursera
Design a Feminist Chatbot
Institute of Coding via FutureLearn