Elevating DoorDash Model Training with Ray: From Good to Great
Offered By: Anyscale via YouTube
Course Description
Overview
Explore a 31-minute conference talk detailing DoorDash's journey to enhance machine learning model training using Ray. Discover how the company addressed challenges in scalability, costs, development velocity, and observability within their forecasting and training pipelines. Learn about the proof-of-concept implementation, benchmarking setup, and integration of Ray into the existing ML Platform architecture. Gain insights into working with open-source KubeRay and the future vision for DoorDash's ML Platform with Ray as a core component. Access the accompanying slide deck for a comprehensive overview of the presentation, which covers topics such as Project Lucent, Lucent Workflows, ARGILLE, and GPU stability.
Syllabus
Intro
Why Train ML Models
Ray Summit 2020
Project Lucent
Lucent Workflows
ARGIL
LEMS
GPU Stability
Closing Thoughts
Questions
Taught by
Anyscale
Related Courses
Sailing Ray Workloads with KubeRay and Kueue in KubernetesCNCF [Cloud Native Computing Foundation] via YouTube Accelerate Your GenAI Model Inference with Ray and Kubernetes
CNCF [Cloud Native Computing Foundation] via YouTube Forecasting Covid Infections for the UK's NHS Using Ray and Kubernetes
Anyscale via YouTube KubeRay: A Ray Cluster Management Solution on Kubernetes
Anyscale via YouTube The Different Shades of Using KubeRay with Kubernetes
Anyscale via YouTube