YoVDO

Kueue: Kubernetes-Native Job Queueing for Batch Workloads

Offered By: CNCF [Cloud Native Computing Foundation] via YouTube

Tags

Kubernetes Courses GPU Computing Courses Cluster Management Courses Capacity Management Courses Autoscaling Courses Kueue Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore Kueue, a Kubernetes-native job queueing system, in this informative conference talk. Dive into the challenges of managing batch workloads in Kubernetes and learn how Kueue addresses limitations in pod-centric scheduling. Discover the concept of job queueing and its benefits for resource management, especially for GPU-intensive tasks. Understand Kueue's design principles, API structure, and integration with existing Kubernetes components. Gain insights into resource allocation across multiple namespaces and clusters, borrowing cohorts, and autoscaling capabilities. Learn about the workload API, custom workload handling, and capacity management. Explore how Kueue interacts with index jobs and enhances overall cluster efficiency for batch processing tasks.

Syllabus

Introduction
Problem description
What is a job
Batch workloads
Flexible on multiple dimensions
The problem
Microbatch jobs
What is Job Queueing
What we want to solve
Why propose a new controller
The concept of a queue
Design principle
Job API
Resource Model
Multiple namespaces
Multiple cluster queues
Borrowing cohorts
Queue API
Cluster Queue API
Dakota
Borrowing Cohort
How it works
suspend
queuing controller
existing controller
handling custom workloads
workload API
workload controller
autoscaling
why
integration
capacity requests
pod management
release
questions
how does the queue interact with index jobs


Taught by

CNCF [Cloud Native Computing Foundation]

Related Courses

Моделирование биологических молекул на GPU (Biomolecular modeling on GPU)
Moscow Institute of Physics and Technology via Coursera
Practical Deep Learning For Coders
fast.ai via Independent
GPU Architectures And Programming
Indian Institute of Technology, Kharagpur via Swayam
Perform Real-Time Object Detection with YOLOv3
Coursera Project Network via Coursera
Getting Started with PyTorch
Coursera Project Network via Coursera