YoVDO

Ray at Scale - Apple's Approach to Elastic GPU Management

Offered By: Anyscale via YouTube

Tags

Scalability Courses Multi-Tenancy Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore Apple's innovative approach to elastic GPU management for scaling AI/ML workloads in this 31-minute conference talk from Ray Summit 2024. Discover how Weiwei Yang and Abin Shahab tackle common challenges like GPU fragmentation, low utilization, and compromised SLAs by building a multi-tenancy ready platform based on Ray. Learn about their queuing and GPU quota management system powered by Apache YuniKorn, and gain insights into advanced techniques for achieving resource fairness, GPU preemption, and gang scheduling across diverse Ray workloads. Gain valuable knowledge for optimizing GPU resource management and enhancing the scalability and efficiency of AI/ML operations in large-scale environments.

Syllabus

Ray at Scale: Apple's Approach to Elastic GPU Management | Ray Summit 2024


Taught by

Anyscale

Related Courses

Cisco SD-WAN (Viptela) with Lab Access
Udemy
Architect SaaS Applications - Unique Challenges & Solutions
Udemy
Provision IoT devices at scale by using Azure IoT Hub Device Provisioning Service (DPS)
Microsoft via Microsoft Learn
Multi-Tenancy and Isolation Using Virtual Clusters in Kubernetes - Mirantis Labs Tech Talks
Mirantis via YouTube
Secure Multi-Cluster & Multi-Tenant Cloud Native Apps with Mirantis & Tetrate
Mirantis via YouTube