Observability in the MLOps Lifecycle with Prometheus
Offered By: USENIX via YouTube
Course Description
Overview
Explore a 24-minute conference talk from SREcon23 Asia/Pacific that delves into the crucial role of observability in the MLOps lifecycle using Prometheus. Begin with a gentle introduction to monitoring ML deployments, covering edge cases in production, data drift, concept drift, model metrics, and standard system and resource metrics. Gain an overview of observability and monitoring in the context of MLOps, understanding how monitoring can inform decisions about model retraining, data collection, and more. Learn how to leverage Prometheus for monitoring and performing essential tasks in MLOps, including methods to enhance existing deployments with powerful monitoring capabilities. Witness demonstrations of Prometheus integration with Flyte, Seldon Core, or FastAPI ML deployments, providing practical insights into implementing observability in real-world scenarios.
Syllabus
SREcon23 Asia/Pacific - Observability in the MLOps Lifecycle with Prometheus
Taught by
USENIX
Related Courses
Building Robust ML Production Systems Using OSS Tools for Continuous Delivery for MLLinux Foundation via YouTube Self-serve Feature Engineering Platform Using Flyte and Feast
Linux Foundation via YouTube Enforcing Data Quality in Data Processing and ML Pipelines with Flyte and Pandera
Linux Foundation via YouTube Efficient Data Parallel Distributed Training with Flyte, Spark and Horovod
Linux Foundation via YouTube Flyte - Cloud Native Machine Learning and Data Processing Platform
Linux Foundation via YouTube