Scaling Batch, Big Data, and AI Workloads Beyond the Kubernetes Scheduler
Offered By: CNCF [Cloud Native Computing Foundation] via YouTube
Course Description
Overview
Explore the challenges and solutions for scaling batch, big data, and AI workloads on Kubernetes in this informative conference talk. Delve into the limitations of the traditional Kubernetes scheduler when handling resource-intensive, heterogeneous processes. Discover recent innovations in the Kubernetes ecosystem designed to address issues such as resource fragmentation, lack of all-or-nothing semantics, low throughput, and limited priority, quota, and preemption management. Compare and contrast projects like Koordinator, Kueue, MCAD, Volcano, and YuniKorn, examining their design choices and trade-offs. Gain valuable insights to help determine the most suitable solution for optimizing Kubernetes cluster utilization for batch workloads. Learn from Red Hat experts Antonin Stefanutti and Anish Asthana as they provide a comprehensive overview of the evolving landscape of Kubernetes scheduling for demanding workloads.
Syllabus
Scale Your Batch / Big Data / AI Workloads Beyond the Kubernetes Scheduler
Taught by
CNCF [Cloud Native Computing Foundation]
Related Courses
SIG Scheduling Deep Dive in Kubernetes - Latest Enhancements and OpportunitiesCNCF [Cloud Native Computing Foundation] via YouTube Kubernetes WG Batch: Recent Improvements and Future Roadmap
CNCF [Cloud Native Computing Foundation] via YouTube Building a Batch System for the Cloud with Kueue
CNCF [Cloud Native Computing Foundation] via YouTube Kueue: Kubernetes-Native Job Queueing for Batch Workloads
CNCF [Cloud Native Computing Foundation] via YouTube Sailing Ray Workloads with KubeRay and Kueue in Kubernetes
CNCF [Cloud Native Computing Foundation] via YouTube