YoVDO

Scaling Inference Deployments with NVIDIA Triton Inference Server and Ray Serve

Offered By: Anyscale via YouTube

Tags

Ray Serve Courses Machine Learning Courses Stable Diffusion Courses Scaling Courses Anyscale Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the collaboration between Ray Serve and NVIDIA Triton Inference Server in this conference talk from Ray Summit 2024. Learn about the new Python API for Triton Inference Server and its seamless integration with Ray Serve applications. Discover how this partnership enhances capabilities for scaling inference deployments, combining the strengths of both open-source platforms. Gain insights into improving ML model performance through a stable diffusion demo and understand the benefits of utilizing Triton's advanced optimization tools like Performance and Model Analyzer. See how to fine-tune model configurations based on specific throughput and latency requirements, empowering you to optimize your inference deployments effectively.

Syllabus

Scaling Inference Deployments with NVIDIA Triton Inference Server and Ray Serve | Ray Summit 2024


Taught by

Anyscale

Related Courses

The New AI Model Licenses Have a Legal Loophole - OpenRAIL-M of BLOOM, Stable Diffusion, etc.
Yannic Kilcher via YouTube
Stable Diffusion - Master AI Art: Installation, Prompts, Txt2img-Img2img, Out-Inpaint and Resize Tutorial
ChamferZone via YouTube
Get Started With Stable Diffusion - Code, HF Spaces, Diffusers Notebooks
Aleksa Gordić - The AI Epiphany via YouTube
Stable Diffusion Animation Tutorial - Deforum All Settings Explained - Make Your Own AI Video
Sebastian Kamph via YouTube
Stable Diffusion - What, Why, How?
Edan Meyer via YouTube