YoVDO

Observability in the MLOps Lifecycle with Prometheus

Offered By: USENIX via YouTube

Tags

SREcon Courses Prometheus Courses MLOps Courses FastAPI Courses Observability Courses Data Drift Courses Flyte Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a 24-minute conference talk from SREcon23 Asia/Pacific that delves into the crucial role of observability in the MLOps lifecycle using Prometheus. Begin with a gentle introduction to monitoring ML deployments, covering edge cases in production, data drift, concept drift, model metrics, and standard system and resource metrics. Gain an overview of observability and monitoring in the context of MLOps, understanding how monitoring can inform decisions about model retraining, data collection, and more. Learn how to leverage Prometheus for monitoring and performing essential tasks in MLOps, including methods to enhance existing deployments with powerful monitoring capabilities. Witness demonstrations of Prometheus integration with Flyte, Seldon Core, or FastAPI ML deployments, providing practical insights into implementing observability in real-world scenarios.

Syllabus

SREcon23 Asia/Pacific - Observability in the MLOps Lifecycle with Prometheus


Taught by

USENIX

Related Courses

How to Detect Silent Failures in ML Models
Data Science Dojo via YouTube
Dataset Management for Computer Vision - Important Component to Delivering Computer Vision Solutions
Open Data Science via YouTube
Testing ML Models in Production - Detecting Data and Concept Drift
Databricks via YouTube
Ekya - Continuous Learning of Video Analytics Models on Edge Compute Servers
USENIX via YouTube
Building and Maintaining High-Performance AI
Data Science Dojo via YouTube