Constructing the 10x Efficiency of Cloud-Native AI Infrastructure
Offered By: CNCF [Cloud Native Computing Foundation] via YouTube
Course Description
Overview
Explore the challenges and solutions for constructing highly efficient cloud-native AI infrastructure in this 35-minute conference talk by Peter Pan and 秋萍 戴 from DaoCloud. Learn how to maximize GPU utilization, unify multi-architecture accelerators, manage organizational quotas and costs, implement resource isolation, and optimize scheduling in AI clouds built on bare-metal infrastructure. Discover strategies for sharing GPU clusters between virtual machines and containers, leveraging high-speed networks, and orchestrating datasets effectively. Gain insights from the speakers' experiences in building AI clouds for IDC and internal use, drawing on open-source stacks from the Linux Foundation and CNCF. Understand how to overcome common obstacles and achieve a tenfold increase in efficiency for cloud-native AI infrastructure.
Syllabus
Constructing the 10x Efficiency of Cloud-Native AI Infrastructure - Peter Pan, DaoCloud & 秋萍 戴
Taught by
CNCF [Cloud Native Computing Foundation]
Related Courses
High Performance ComputingGeorgia Institute of Technology via Udacity Введение в параллельное программирование с использованием OpenMP и MPI
Tomsk State University via Coursera High Performance Computing in the Cloud
Dublin City University via FutureLearn Production Machine Learning Systems
Google Cloud via Coursera LAFF-On Programming for High Performance
The University of Texas at Austin via edX