YoVDO

SLA-Aware Machine Learning Inference Serving on Serverless Computing Platforms

Offered By: MLOps World: Machine Learning in Production via YouTube

Tags

Machine Learning Courses Cloud Computing Courses Knative Courses MLOps Courses Serverless Computing Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a conference talk on SLA-aware machine learning inference serving on serverless computing platforms. Delve into the challenges of serving machine learning inference workloads in production environments and the complexities of meeting SLA requirements while optimizing infrastructure costs. Learn about MLProxy, an adaptive reverse proxy designed to support efficient machine learning serving workloads on serverless systems. Discover how MLProxy utilizes adaptive batching to ensure SLA compliance and optimize serverless costs. Examine the results of rigorous experiments conducted on Knative, demonstrating MLProxy's ability to significantly reduce serverless deployment costs and SLA violations across various model serving frameworks.

Syllabus

SLA Aware Machine Learning Inference Serving on Serverless Computing Platforms


Taught by

MLOps World: Machine Learning in Production

Related Courses

Software as a Service
University of California, Berkeley via Coursera
Software Defined Networking
Georgia Institute of Technology via Coursera
Pattern-Oriented Software Architectures: Programming Mobile Services for Android Handheld Systems
Vanderbilt University via Coursera
Web-Technologien
openHPI
Données et services numériques, dans le nuage et ailleurs
Certificat informatique et internet via France Université Numerique