YoVDO

Unlocking Heterogeneous AI Infrastructure K8s Cluster - Leveraging the Power of HAMi

Offered By: CNCF [Cloud Native Computing Foundation] via YouTube

Tags

Kubernetes Courses Artificial Intelligence Courses Heterogeneous Computing Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the challenges and solutions for managing heterogeneous AI infrastructure in Kubernetes clusters through this comprehensive conference talk. Dive into the HAMi project, designed to address the complexities of integrating diverse AI devices like NVIDIA, Intel, and Huawei Ascend. Learn about unified scheduling, observability, and strategies to improve resource utilization of expensive AI hardware. Discover techniques for GPU sharing, ensuring QoS for high-priority tasks, and implementing flexible scheduling policies. Gain insights from real-world case studies and explore integrations with other projects such as Volcano and scheduler-plugin. Understand the current challenges and future roadmap for optimizing heterogeneous AI device management in Kubernetes environments.

Syllabus

Unlocking Heterogeneous AI Infrastructure K8s Cluster: Leveraging the...- Xiao Zhang & Mengxuan Li


Taught by

CNCF [Cloud Native Computing Foundation]

Related Courses

Future of Computing - IBM Power 9 and beyond
openHPI
SIGCOMM 2020 - Reducto - On-Camera Filtering for Resource-Efficient Real-Time Video Analytics
Association for Computing Machinery (ACM) via YouTube
Offload Annotations - Bringing Heterogeneous Computing to Existing Libraries and Workloads
USENIX via YouTube
Supercomputing Spotlights - Supercomputing Software for Moore and Beyond
Society for Industrial and Applied Mathematics via YouTube
Liquid Metal - Taming Heterogeneity
GOTO Conferences via YouTube