YoVDO

Deploying Many Models Efficiently with Ray Serve

Offered By: Anyscale via YouTube

Tags

Machine Learning Courses Distributed Systems Courses Model Deployment Courses Ray Serve Courses Anyscale Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore efficient deployment and management of multiple models using Ray Serve in this 26-minute conference talk. Gain comprehensive insights into serving numerous models while optimizing resource utilization and maintaining ease of use. Learn about three key features of Ray Serve: model composition, multi-application, and model multiplexing. Discover common industry patterns for serving many models and how Ray Serve simplifies management and enhances performance. Dive into case studies of Ray Serve users running many-model applications in production. Access the slide deck for additional information and visual aids. Understand how Ray, an open-source framework, powers ambitious AI workloads, including Generative AI, LLMs, and computer vision. Consider Anyscale's managed Ray service for developing, running, and scaling AI applications.

Syllabus

Deploying Many Models Efficiently with Ray Serve


Taught by

Anyscale

Related Courses

Advanced Operating Systems
Georgia Institute of Technology via Udacity
High Performance Computing
Georgia Institute of Technology via Udacity
GT - Refresher - Advanced OS
Georgia Institute of Technology via Udacity
Distributed Machine Learning with Apache Spark
University of California, Berkeley via edX
CS125x: Advanced Distributed Machine Learning with Apache Spark
University of California, Berkeley via edX