Ray at Scale - Apple's Approach to Elastic GPU Management
Offered By: Anyscale via YouTube
Course Description
Overview
Explore Apple's innovative approach to elastic GPU management for scaling AI/ML workloads in this 31-minute conference talk from Ray Summit 2024. Discover how Weiwei Yang and Abin Shahab tackle common challenges like GPU fragmentation, low utilization, and compromised SLAs by building a multi-tenancy ready platform based on Ray. Learn about their queuing and GPU quota management system powered by Apache YuniKorn, and gain insights into advanced techniques for achieving resource fairness, GPU preemption, and gang scheduling across diverse Ray workloads. Gain valuable knowledge for optimizing GPU resource management and enhancing the scalability and efficiency of AI/ML operations in large-scale environments.
Syllabus
Ray at Scale: Apple's Approach to Elastic GPU Management | Ray Summit 2024
Taught by
Anyscale
Related Courses
Financial Sustainability: The Numbers side of Social Enterprise+Acumen via NovoEd Cloud Computing Concepts: Part 2
University of Illinois at Urbana-Champaign via Coursera Developing Repeatable ModelsĀ® to Scale Your Impact
+Acumen via Independent Managing Microsoft Windows Server Active Directory Domain Services
Microsoft via edX Introduction aux conteneurs
Microsoft Virtual Academy via OpenClassrooms