YoVDO

Observability in the MLOps Lifecycle with Prometheus

Offered By: USENIX via YouTube

Tags

SREcon Courses Prometheus Courses MLOps Courses FastAPI Courses Observability Courses Data Drift Courses Flyte Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a 24-minute conference talk from SREcon23 Asia/Pacific that delves into the crucial role of observability in the MLOps lifecycle using Prometheus. Begin with a gentle introduction to monitoring ML deployments, covering edge cases in production, data drift, concept drift, model metrics, and standard system and resource metrics. Gain an overview of observability and monitoring in the context of MLOps, understanding how monitoring can inform decisions about model retraining, data collection, and more. Learn how to leverage Prometheus for monitoring and performing essential tasks in MLOps, including methods to enhance existing deployments with powerful monitoring capabilities. Witness demonstrations of Prometheus integration with Flyte, Seldon Core, or FastAPI ML deployments, providing practical insights into implementing observability in real-world scenarios.

Syllabus

SREcon23 Asia/Pacific - Observability in the MLOps Lifecycle with Prometheus


Taught by

USENIX

Related Courses

How to Not Destroy Your Production Kubernetes Clusters
USENIX via YouTube
SRE and ML - Why It Matters
USENIX via YouTube
Knowledge and Power - A Sociotechnical Systems Discussion on the Future of SRE
USENIX via YouTube
Tracing Bare Metal with OpenTelemetry
USENIX via YouTube
Improving How We Observe Our Observability Data - Techniques for SREs
USENIX via YouTube