Batch Systems in Production with Kueue - Multi-Tenancy and Fungibility
Offered By: CNCF [Cloud Native Computing Foundation] via YouTube
Course Description
Overview
Explore the capabilities of Kueue, a cloud-native job scheduler for building multi-tenant batch systems on Kubernetes clusters, in this informative conference talk. Learn about Kueue's architecture, extensibility, and its ability to support various workloads while implementing job queueing based on quotas, priority, and resource sharing hierarchies. Discover how Kueue operates in both on-premises and autoscaled cloud environments, maximizing resource utilization through borrowing and preemption mechanisms. Gain insights into Kueue's real-world application in production self-managed clusters, serving machine-learning researchers, MLOps engineers, and data scientists. Understand how Kueue integrates with popular frameworks like DeepSpeed, PyTorch, Kubernetes Job, RayJob, and Jupyter to provide fair resource use and efficient management of accelerators and other resources.
Syllabus
Batch Systems in Production with Kueue: Multi-Tenancy and Fungibility- Yuki Iwai & Aldo Culquicondor
Taught by
CNCF [Cloud Native Computing Foundation]
Related Courses
Cisco SD-WAN (Viptela) with Lab AccessUdemy Architect SaaS Applications - Unique Challenges & Solutions
Udemy Provision IoT devices at scale by using Azure IoT Hub Device Provisioning Service (DPS)
Microsoft via Microsoft Learn Multi-Tenancy and Isolation Using Virtual Clusters in Kubernetes - Mirantis Labs Tech Talks
Mirantis via YouTube Secure Multi-Cluster & Multi-Tenant Cloud Native Apps with Mirantis & Tetrate
Mirantis via YouTube