YoVDO

The State of vLLM - Advancements in LLM Inference and Serving

Offered By: Anyscale via YouTube

Tags

vLLM Courses Machine Learning Courses Inference Courses Distributed Computing Courses Open Source Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the latest developments in vLLM, the open-source LLM inference and serving engine, in this 35-minute conference talk from Ray Summit 2024. Join Kuntai Du from the University of Chicago and Zhuohan Li from UC Berkeley as they delve into the significant progress made by vLLM over the past year. Discover the project's growing adoption, new features, and performance improvements. Gain insights into the community growth and governance changes shaping vLLM's ecosystem. Learn about the roadmap for upcoming releases and get a glimpse into the future of this rapidly evolving LLM serving solution. Ideal for those interested in efficient LLM deployment and serving technologies, this presentation offers valuable information on the cutting-edge advancements in the field.

Syllabus

The State of vLLM | Ray Summit 2024


Taught by

Anyscale

Related Courses

Finetuning, Serving, and Evaluating Large Language Models in the Wild
Open Data Science via YouTube
Cloud Native Sustainable LLM Inference in Action
CNCF [Cloud Native Computing Foundation] via YouTube
Optimizing Kubernetes Cluster Scaling for Advanced Generative Models
Linux Foundation via YouTube
LLaMa for Developers
LinkedIn Learning
Scaling Video Ad Classification Across Millions of Classes with GenAI
Databricks via YouTube