YoVDO

Unlocking Heterogeneous AI Infrastructure in Kubernetes Clusters - Leveraging HAMi

Offered By: CNCF [Cloud Native Computing Foundation] via YouTube

Tags

Kubernetes Courses Artificial Intelligence Courses Heterogeneous Computing Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the challenges and solutions for managing heterogeneous AI infrastructure in Kubernetes clusters through this informative conference talk. Delve into the HAMi project, designed to address the complexities of integrating diverse AI devices such as NVIDIA, Intel, and Huawei Ascend. Learn how to improve resource utilization, implement unified scheduling and observability, and enhance GPU sharing capabilities. Discover flexible scheduling strategies for GPUs, including NUMA affinity/anti-affinity and binpack/spread options. Gain insights into integrating HAMi with other projects like Volcano and scheduler-plugin. Examine real-world case studies from production-level users and discuss ongoing challenges and future roadmap for heterogeneous AI infrastructure management in Kubernetes environments.

Syllabus

Unlocking Heterogeneous AI Infrastructure K8s Cluster: Leveraging the Power of HAMi - Xiao Zhang


Taught by

CNCF [Cloud Native Computing Foundation]

Related Courses

Introduction to Artificial Intelligence
Stanford University via Udacity
Probabilistic Graphical Models 1: Representation
Stanford University via Coursera
Artificial Intelligence for Robotics
Stanford University via Udacity
Computer Vision: The Fundamentals
University of California, Berkeley via Coursera
Learning from Data (Introductory Machine Learning course)
California Institute of Technology via Independent