YoVDO

Building Serverless AI Apps with Spin and WebAssembly

Offered By: CNCF [Cloud Native Computing Foundation] via YouTube

Tags

WebAssembly Courses Kubernetes Courses Serverless Computing Courses Docker Desktop Courses LLaMA2 Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the world of serverless AI applications in this conference talk by Matt Butcher and Radu Matei from Fermyon. Discover how Spin, an open-source tool, enables the creation of scalable serverless WebAssembly apps. Learn about WebAssembly's platform neutrality and its ability to run on various OSes, CPU architectures, and GPUs. Follow along as the speakers build a simple AI inferencing app using the LLaMa2 Chat LLM, demonstrating local testing and deployment across different environments, including Docker Desktop and Kubernetes clusters with Wasm support. Gain insights into the performance characteristics of each environment and delve into the nuances of GPU scheduling in clustered environments. Understand how Spin's fine-grained GPU scheduling can enhance GPU utilization across multiple applications, providing valuable knowledge for developers interested in efficient AI app deployment and optimization.

Syllabus

Building Serverless AI Apps with Spin and WebAssembly - Matt Butcher & Radu Matei, Fermyon


Taught by

CNCF [Cloud Native Computing Foundation]

Related Courses

LLaMA2 for Multilingual Fine Tuning
Sam Witteveen via YouTube
Set Up a Llama2 Endpoint for Your LLM App in OctoAI
Docker via YouTube
AI Engineer Skills for Beginners: Code Generation Techniques
All About AI via YouTube
Training and Evaluating LLaMA2 Models with Argo Workflows and Hera
CNCF [Cloud Native Computing Foundation] via YouTube
LangChain Crash Course - 6 End-to-End LLM Projects with OpenAI, LLAMA2, and Gemini Pro
Krish Naik via YouTube