YoVDO

Sailing Ray Workloads with KubeRay and Kueue in Kubernetes

Offered By: CNCF [Cloud Native Computing Foundation] via YouTube

Tags

Kubernetes Courses Machine Learning Courses Cloud Computing Courses Distributed Computing Courses Kueue Courses KubeRay Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore how to manage Ray workloads in Kubernetes using KubeRay and Kueue in this informative conference talk. Learn about the growing compute demands in machine learning and how Ray, a unified computing framework, enables ML engineers to scale workloads without complex infrastructure. Discover the benefits of using Kubernetes with KubeRay for managing diverse workloads, and gain insights from ByteDance's experience of submitting thousands of jobs daily to Ray clusters. Understand the challenges of managing concurrent Ray jobs, including job starvation and resource allocation, and how Kueue, a Kubernetes native job queueing system, addresses these issues with features like resource management, multi-tenant support, and fair-sharing.

Syllabus

Sailing Ray Workloads with KubeRay and Kueue in Kubernetes - Jason Hu, Volcano Engine & Kante Yin


Taught by

CNCF [Cloud Native Computing Foundation]

Related Courses

SIG Scheduling Deep Dive in Kubernetes - Latest Enhancements and Opportunities
CNCF [Cloud Native Computing Foundation] via YouTube
Kubernetes WG Batch: Recent Improvements and Future Roadmap
CNCF [Cloud Native Computing Foundation] via YouTube
Building a Batch System for the Cloud with Kueue
CNCF [Cloud Native Computing Foundation] via YouTube
Kueue: Kubernetes-Native Job Queueing for Batch Workloads
CNCF [Cloud Native Computing Foundation] via YouTube
Batch Systems in Production with Kueue - Multi-Tenancy and Fungibility
CNCF [Cloud Native Computing Foundation] via YouTube