YoVDO

Scaling Thanos and Prometheus for Massive Metrics Deployment at Reddit

Offered By: CNCF [Cloud Native Computing Foundation] via YouTube

Tags

DevOps Courses Kubernetes Courses Prometheus Courses Scalability Courses Cloud Infrastructure Courses Observability Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore how Reddit scales its monitoring infrastructure using Thanos and Prometheus in this informative conference talk. Discover the custom monitoring operator developed by Reddit to manage thousands of Prometheus instances, handling over 45 million samples per second and 600 million active series. Learn about the Kubernetes controller used to orchestrate this massive deployment and how Thanos enables long-term storage and global querying capabilities. Gain insights into the tools developed by Reddit's team, the challenges they faced, and the solutions implemented to achieve a robust and scalable metrics system for one of the world's largest social media platforms.

Syllabus

Scaling Thanos at Reddit - Ben Kochie & Trevor Riles, Reddit


Taught by

CNCF [Cloud Native Computing Foundation]

Related Courses

Kubernetes Hands-On - Deploy Microservices to the AWS Cloud
Udemy
Learn DevOps: Advanced Kubernetes Usage
Udemy
Monitoring & Telemetry for Production Systems
Coursera Project Network via Coursera
Kubernetes: Cloud Native Ecosystem
LinkedIn Learning
Kubernetes: Monitoring with Prometheus
LinkedIn Learning