Building Serverless AI Apps with Spin and WebAssembly
Offered By: CNCF [Cloud Native Computing Foundation] via YouTube
Course Description
Overview
Explore the world of serverless AI applications in this conference talk by Matt Butcher and Radu Matei from Fermyon. Discover how Spin, an open-source tool, enables the creation of scalable serverless WebAssembly apps. Learn about WebAssembly's platform neutrality and its ability to run on various OSes, CPU architectures, and GPUs. Follow along as the speakers build a simple AI inferencing app using the LLaMa2 Chat LLM, demonstrating local testing and deployment across different environments, including Docker Desktop and Kubernetes clusters with Wasm support. Gain insights into the performance characteristics of each environment and delve into the nuances of GPU scheduling in clustered environments. Understand how Spin's fine-grained GPU scheduling can enhance GPU utilization across multiple applications, providing valuable knowledge for developers interested in efficient AI app deployment and optimization.
Syllabus
Building Serverless AI Apps with Spin and WebAssembly - Matt Butcher & Radu Matei, Fermyon
Taught by
CNCF [Cloud Native Computing Foundation]
Related Courses
Introduction to Cloud Infrastructure TechnologiesLinux Foundation via edX Scalable Microservices with Kubernetes
Google via Udacity Google Cloud Fundamentals: Core Infrastructure
Google via Coursera Introduction to Kubernetes
Linux Foundation via edX Fundamentals of Containers, Kubernetes, and Red Hat OpenShift
Red Hat via edX