Scaling Thanos and Prometheus for Massive Metrics Deployment at Reddit
Offered By: CNCF [Cloud Native Computing Foundation] via YouTube
Course Description
Overview
Explore how Reddit scales its monitoring infrastructure using Thanos and Prometheus in this informative conference talk. Discover the custom monitoring operator developed by Reddit to manage thousands of Prometheus instances, handling over 45 million samples per second and 600 million active series. Learn about the Kubernetes controller used to orchestrate this massive deployment and how Thanos enables long-term storage and global querying capabilities. Gain insights into the tools developed by Reddit's team, the challenges they faced, and the solutions implemented to achieve a robust and scalable metrics system for one of the world's largest social media platforms.
Syllabus
Scaling Thanos at Reddit - Ben Kochie & Trevor Riles, Reddit
Taught by
CNCF [Cloud Native Computing Foundation]
Related Courses
Kubernetes Hands-On - Deploy Microservices to the AWS CloudUdemy Learn DevOps: Advanced Kubernetes Usage
Udemy Monitoring & Telemetry for Production Systems
Coursera Project Network via Coursera Kubernetes: Cloud Native Ecosystem
LinkedIn Learning Kubernetes: Monitoring with Prometheus
LinkedIn Learning