Scaling Thanos and Prometheus for Massive Metrics Deployment at Reddit
Offered By: CNCF [Cloud Native Computing Foundation] via YouTube
Course Description
Overview
Explore how Reddit scales its monitoring infrastructure using Thanos and Prometheus in this informative conference talk. Discover the custom monitoring operator developed by Reddit to manage thousands of Prometheus instances, handling over 45 million samples per second and 600 million active series. Learn about the Kubernetes controller used to orchestrate this massive deployment and how Thanos enables long-term storage and global querying capabilities. Gain insights into the tools developed by Reddit's team, the challenges they faced, and the solutions implemented to achieve a robust and scalable metrics system for one of the world's largest social media platforms.
Syllabus
Scaling Thanos at Reddit - Ben Kochie & Trevor Riles, Reddit
Taught by
CNCF [Cloud Native Computing Foundation]
Related Courses
Введение в теорию кибернетических системSaint Petersburg State University via Coursera Dynamical System and Control
Indian Institute of Technology Roorkee via Swayam Kyma – A Flexible Way to Connect and Extend Applications
SAP Learning Linear Systems Theory
Indian Institute of Technology Madras via Swayam Introduction to DevOps and Site Reliability Engineering
Linux Foundation via edX