When Prometheus Can't Take the Load Anymore
Offered By: CNCF [Cloud Native Computing Foundation] via YouTube
Course Description
Overview
Explore a comprehensive analysis of scaling Prometheus for high-load environments in this conference talk. Discover the journey of Riskified's SRE team as they encountered limitations with their initial Prometheus setup and embarked on a quest to find the optimal solution for multi-cluster, high-availability, and long-term metrics storage. Compare the features, advantages, and disadvantages of Thanos, Cortex, and M3 as potential alternatives. Gain valuable insights into the performance, cost-effectiveness, and operational aspects of each tool, enabling you to make informed decisions for your own use case. Learn about the Cortex architecture, its advantages, and potential drawbacks. By the end of this presentation, acquire a deeper understanding of advanced monitoring solutions and their applicability in cloud-native environments.
Syllabus
Introduction
The Problem
The Right Path
Cortex
Cortex Advantages
Atmos Architecture
Atmos Disadvantages
Summary
Other Aspects
Conclusion
Taught by
CNCF [Cloud Native Computing Foundation]
Related Courses
Building Geospatial Apps on Postgres, PostGIS, & Citus at Large ScaleMicrosoft via YouTube Unlocking the Power of ML for Your JavaScript Applications with TensorFlow.js
TensorFlow via YouTube Managing the Reactive World with RxJava - Jake Wharton
ChariotSolutions via YouTube What's New in Grails 2.0
ChariotSolutions via YouTube Performance Analysis of Apache Spark and Presto in Cloud Environments
Databricks via YouTube