YoVDO

Building Large-Scale Data Processing Pipelines for Multimodal Models with Ray - ByteDance Case Study

Offered By: Anyscale via YouTube

Tags

Data Processing Courses Distributed Computing Courses Ray Serve Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore ByteDance's innovative approach to building large-scale data processing pipelines for multimodal models using Ray in this 35-minute conference talk. Discover how Xiaohong Dong, Wanxing Wang, and Liguang Xie from ByteDance tackled the challenges of processing vast amounts of high-quality video data for advanced video generation models. Learn about their utilization of Ray's ecosystem, including Ray Core, Ray Data, and Ray Serve, to create a robust and scalable data pipeline. Gain valuable insights into managing Ray infrastructure, best practices for large-scale multimodal AI projects, and solutions for dynamic scaling and orchestration of heterogeneous resources. Uncover a blueprint for leveraging Ray in ambitious AI endeavors and understand how ByteDance overcame the complexities of handling massive video datasets.

Syllabus

How Bytedance Builds Large-Scale Data Processing Pipelines for Multimodal Models with Ray | RS 24


Taught by

Anyscale

Related Courses

Patterns of ML Models in Production
PyCon US via YouTube
Deploying Many Models Efficiently with Ray Serve
Anyscale via YouTube
Modernizing DoorDash Model Serving Platform with Ray Serve
Anyscale via YouTube
Ray for Large-Scale Time-Series Energy Forecasting to Plan a More Resilient Power Grid
Anyscale via YouTube
Enabling Cost-Efficient LLM Serving with Ray Serve
Anyscale via YouTube