Distributed Training with Ray on Kubernetes - Lyft's ML Platform
Offered By: Anyscale via YouTube
Course Description
Overview
Explore how Lyft leverages Ray on Kubernetes for distributed training in this insightful conference talk. Discover the ML platform's infrastructure built entirely on Kubernetes, highlighting its scalability and rapid resource bootstrapping capabilities. Learn about the custom SDKs developed to enable users to spawn on-demand Ray clusters for model training directly from notebooks. Gain valuable insights into how these SDKs abstract and conceal the complexities of cluster management, allowing users to focus on their core tasks while the platform handles the technical details. Understand the innovative approach to creating a robust infrastructure for distributed training, combining the power of Ray with the flexibility of Kubernetes.
Syllabus
Distributed training with Ray on Kubernetes at Lyft
Taught by
Anyscale
Related Courses
Introduction to Cloud Infrastructure TechnologiesLinux Foundation via edX Scalable Microservices with Kubernetes
Google via Udacity Google Cloud Fundamentals: Core Infrastructure
Google via Coursera Introduction to Kubernetes
Linux Foundation via edX Fundamentals of Containers, Kubernetes, and Red Hat OpenShift
Red Hat via edX