YoVDO

Heterogeneous Training Cluster with Ray at Netflix

Offered By: Anyscale via YouTube

Tags

Machine Learning Courses Deep Learning Courses Distributed Systems Courses GPU Computing Courses Scalability Courses Cluster Management Courses Heterogeneous Computing Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the benefits of using Ray to build a heterogeneous training cluster for deep learning models at Netflix. Learn how to set up a cluster with a mix of CPU and GPU instances, run distributed training jobs, and leverage Ray's automatic resource allocation for scheduling different types of workers. Discover best practices for configuring and managing persistent clusters using Ray, while addressing challenges in building and maintaining such systems. Gain insights into how Netflix's Machine Learning Platform team optimizes infrastructure for various use cases, including recommendations, content understanding, and artwork generation. Understand the importance of reliable, scalable, and robust training and deployment of machine learning models in the entertainment industry.

Syllabus

Heterogeneous Training Cluster with Ray at Netflix


Taught by

Anyscale

Related Courses

Adobe Experience Manager and MongoDB
MongoDB University
Elastic Cloud Infrastructure: Containers and Services auf Deutsch
Google Cloud via Coursera
Architecting with Google Kubernetes Engine: Foundations en Français
Google Cloud via Coursera
Kubernetes Hands-On - Deploy Microservices to the AWS Cloud
Udemy
Docker Swarm: BEGINNER + ADVANCED
Udemy