YoVDO

Introduction to Model Deployment with Ray Serve

Offered By: MLOps World: Machine Learning in Production via YouTube

Tags

Ray Serve Courses Python Courses Distributed Systems Courses MLOps Courses FastAPI Courses Scalable Architectures Courses Model Deployment Courses MLFlow Courses Anyscale Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the fundamentals of model deployment using Ray Serve in this comprehensive tutorial from MLOps World: Machine Learning in Production. Dive into a two-part session led by Jules Damji and Archit Kulkarni from Anyscale Inc, covering Ray core APIs and Ray Serve concepts. Begin with hands-on coding exercises to master Ray's distributed computing patterns, then progress to Ray Serve's scalable architecture and deployment strategies. Learn to create, expose, and deploy models using core deployment APIs through practical Jupyter notebook examples. Discover Ray Serve's integration with MLflow for model registry management and its compatibility with FastAPI. Gain essential skills in converting Python functions and classes to distributed settings, utilizing Ray Serve APIs for model deployment, accessing deployment endpoints, configuring compute resources for production scaling, and leveraging key integrations to enhance your MLOps workflow.

Syllabus

Introduction to Model Deployment with Ray Serve


Taught by

MLOps World: Machine Learning in Production

Related Courses

Optimizing LLM Inference with AWS Trainium, Ray, vLLM, and Anyscale
Anyscale via YouTube
Scalable and Cost-Efficient AI Workloads with AWS and Anyscale
Anyscale via YouTube
End-to-End LLM Workflows with Anyscale
Anyscale via YouTube
Developing and Serving RAG-Based LLM Applications in Production
Anyscale via YouTube
Deploying Many Models Efficiently with Ray Serve
Anyscale via YouTube