Bytedance Spark Support for Wanka Model Inference - GPU Optimization Practices

Offered By: The ASF via YouTube

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Explore how Bytedance's infrastructure team enhanced Spark to support large-scale GPU-based model inference on Kubernetes. Learn about the challenges faced in migrating from Hadoop to Kubernetes, including GPU computing power supply issues, resource pool scaling, and online resource waste. Discover the solutions implemented through GPU sharing technology, mixed GPU scheduling, Spark engine improvements, and platform enhancements. Gain insights into how these advancements enabled efficient processing of 8 billion multi-modal training data points using 7,000 mixed GPUs in just 7.5 hours, significantly improving resource efficiency and stability for Wanka model inference practices.

Syllabus

Bytedance Spark Supports Wanka Model Inference Practices

Taught by

The ASF

Related Courses

Моделирование биологических молекул на GPU (Biomolecular modeling on GPU)
Moscow Institute of Physics and Technology via Coursera Practical Deep Learning For Coders
fast.ai via Independent GPU Architectures And Programming
Indian Institute of Technology, Kharagpur via Swayam Perform Real-Time Object Detection with YOLOv3
Coursera Project Network via Coursera Getting Started with PyTorch
Coursera Project Network via Coursera