YoVDO

Serving Large Language Models with KubeRay on TPUs

Offered By: Anyscale via YouTube

Tags

Kubernetes Courses Scalability Courses TPUs Courses Distributed Machine Learning Courses KubeRay Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Discover how to serve large language models using KubeRay on TPUs in this 25-minute talk from Anyscale. Learn about the technical challenges of serving models with hundreds of billions of parameters and explore how integrating KubeRay with TPUs creates a powerful platform for efficient LLM deployment. Gain insights into the benefits of this approach, including increased performance, improved scalability, reduced costs, enhanced flexibility, and better monitoring capabilities. Understand how KubeRay simplifies Ray cluster management on cloud platforms, while TPUs provide specialized processing power for neural network workloads. Access the accompanying slide deck for visual references and dive deeper into the world of distributed machine learning with Ray, the popular open-source framework for scaling AI workloads.

Syllabus

Serving Large Language Models with KubeRay on TPUs


Taught by

Anyscale

Related Courses

Scalable Data Science
Indian Institute of Technology, Kharagpur via Swayam
Data Science and Engineering with Spark
Berkeley University of California via edX
Data Science on Google Cloud: Machine Learning
Google via Qwiklabs
Modern Distributed Systems
Delft University of Technology via edX
KungFu - Making Training in Distributed Machine Learning Adaptive
USENIX via YouTube