Building Large-Scale Data Processing Pipelines for Multimodal Models with Ray - ByteDance Case Study
Offered By: Anyscale via YouTube
Course Description
Overview
Explore ByteDance's innovative approach to building large-scale data processing pipelines for multimodal models using Ray in this 35-minute conference talk. Discover how Xiaohong Dong, Wanxing Wang, and Liguang Xie from ByteDance tackled the challenges of processing vast amounts of high-quality video data for advanced video generation models. Learn about their utilization of Ray's ecosystem, including Ray Core, Ray Data, and Ray Serve, to create a robust and scalable data pipeline. Gain valuable insights into managing Ray infrastructure, best practices for large-scale multimodal AI projects, and solutions for dynamic scaling and orchestration of heterogeneous resources. Uncover a blueprint for leveraging Ray in ambitious AI endeavors and understand how ByteDance overcame the complexities of handling massive video datasets.
Syllabus
How Bytedance Builds Large-Scale Data Processing Pipelines for Multimodal Models with Ray | RS 24
Taught by
Anyscale
Related Courses
Patterns of ML Models in ProductionPyCon US via YouTube Deploying Many Models Efficiently with Ray Serve
Anyscale via YouTube Modernizing DoorDash Model Serving Platform with Ray Serve
Anyscale via YouTube Ray for Large-Scale Time-Series Energy Forecasting to Plan a More Resilient Power Grid
Anyscale via YouTube Enabling Cost-Efficient LLM Serving with Ray Serve
Anyscale via YouTube