The Challenges of Building AI Infrastructure on Virtualization
Offered By: KVM Forum via YouTube
Course Description
Overview
Explore the challenges and solutions for building AI infrastructure on virtualized environments in this KVM Forum presentation. Delve into the complexities of cloud computing for AI applications, focusing on heterogeneous computing and its unique requirements. Examine two key issues: the performance degradation in PCIe P2P communication between GPUs or GPUs and RDMA NICs due to IOMMU, and the limitations of traditional PMU virtualization for high-precision monitoring. Discover proposed solutions, including techniques to avoid P2P TLB redirection to IOMMU and methods for passthrough of core and uncore PMUs to guest systems. Learn how these approaches aim to narrow the gap between virtualized and bare-metal environments for AI infrastructure, presented by ByteDance virtualization experts Xin He and Hao Hong.
Syllabus
The Challenges of building AI Infra on virtualization by Xin He & Hao Hong
Taught by
KVM Forum
Related Courses
Software as a ServiceUniversity of California, Berkeley via Coursera Software Defined Networking
Georgia Institute of Technology via Coursera Pattern-Oriented Software Architectures: Programming Mobile Services for Android Handheld Systems
Vanderbilt University via Coursera Web-Technologien
openHPI Données et services numériques, dans le nuage et ailleurs
Certificat informatique et internet via France Université Numerique