Observability in the MLOps Lifecycle with Prometheus

Offered By: USENIX via YouTube

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Explore a 24-minute conference talk from SREcon23 Asia/Pacific that delves into the crucial role of observability in the MLOps lifecycle using Prometheus. Begin with a gentle introduction to monitoring ML deployments, covering edge cases in production, data drift, concept drift, model metrics, and standard system and resource metrics. Gain an overview of observability and monitoring in the context of MLOps, understanding how monitoring can inform decisions about model retraining, data collection, and more. Learn how to leverage Prometheus for monitoring and performing essential tasks in MLOps, including methods to enhance existing deployments with powerful monitoring capabilities. Witness demonstrations of Prometheus integration with Flyte, Seldon Core, or FastAPI ML deployments, providing practical insights into implementing observability in real-world scenarios.

Syllabus

SREcon23 Asia/Pacific - Observability in the MLOps Lifecycle with Prometheus

Taught by

USENIX

Related Courses

Building Robust ML Production Systems Using OSS Tools for Continuous Delivery for ML
Linux Foundation via YouTube Self-serve Feature Engineering Platform Using Flyte and Feast
Linux Foundation via YouTube Enforcing Data Quality in Data Processing and ML Pipelines with Flyte and Pandera
Linux Foundation via YouTube Efficient Data Parallel Distributed Training with Flyte, Spark and Horovod
Linux Foundation via YouTube Flyte - Cloud Native Machine Learning and Data Processing Platform
Linux Foundation via YouTube