Unlocking New Pose in HPC - Containerization, Cloud, and GPU-based Workloads
Offered By: CNCF [Cloud Native Computing Foundation] via YouTube
Course Description
Overview
Explore cutting-edge advancements in High-Performance Computing (HPC) through this informative conference talk. Delve into the integration of containerization, cloud technologies, and GPU-based workloads in HPC environments. Learn how Kubernetes enables unified management of heterogeneous computing, network, and storage resources. Discover techniques for GPU virtualization, creating shared resource pools for fine-grained quota management and multi-tenant sharing. Understand the implementation of custom Kubernetes schedulers for prioritized GPU task management. Gain insights into GPU visual monitoring using Prometheus, offering aggregated performance metrics and granular monitoring capabilities. Explore the creation of a one-stop scientist workbench for end-to-end algorithm development, model training, and AI service deployment in the cloud using Kubernetes-scheduled mainstream artificial intelligence frameworks.
Syllabus
Unlocking New Pose in HPC—Containerization, Cloud, and GPU-based Workloads- Ying Xu & Xianglong Zeng
Taught by
CNCF [Cloud Native Computing Foundation]
Related Courses
Fundamentals of Containers, Kubernetes, and Red Hat OpenShiftRed Hat via edX Configuration Management for Containerized Delivery
Microsoft via edX Getting Started with Google Kubernetes Engine - Español
Google Cloud via Coursera Getting Started with Google Kubernetes Engine - 日本語版
Google Cloud via Coursera Architecting with Google Kubernetes Engine: Foundations en Español
Google Cloud via Coursera