YoVDO

Strategies for Efficient LLM Deployments in Any Cluster

Offered By: CNCF [Cloud Native Computing Foundation] via YouTube

Tags

Kubernetes Courses Cloud Computing Courses WebAssembly Courses Model Selection Courses Edge Computing Courses Model Optimization Courses Cluster Management Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore strategies for efficient Large Language Model (LLM) deployments in any cluster through this informative conference talk. Discover how to overcome challenges posed by LLMs' substantial size, resource demands, and management complexity in Kubernetes environments. Learn techniques to reduce model footprint, enabling deployment from cloud to edge. Gain insights on selecting the right model, reducing size, and optimizing resource utilization through WebAssembly. Understand the balance between resource usage and quality in LLM deployments. Stay updated on emerging technologies, projects, and models in this rapidly evolving ecosystem.

Syllabus

Strategies for Efficient LLM Deployments in Any Cluster -Angel M De Miguel Meana & Francisco Cabrera


Taught by

CNCF [Cloud Native Computing Foundation]

Related Courses

Software as a Service
University of California, Berkeley via Coursera
Software Defined Networking
Georgia Institute of Technology via Coursera
Pattern-Oriented Software Architectures: Programming Mobile Services for Android Handheld Systems
Vanderbilt University via Coursera
Web-Technologien
openHPI
Données et services numériques, dans le nuage et ailleurs
Certificat informatique et internet via France Université Numerique