YoVDO

High Available and Scalable Prometheus with Thanos in Alibaba

Offered By: Linux Foundation via YouTube

Tags

Prometheus Courses Cloud Computing Courses E-commerce Courses Kubernetes Courses Scalability Courses High Availability Courses Alibaba Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore how Alibaba Group leverages Kubernetes, Prometheus, and Thanos to support their massive e-commerce operations in this conference talk. Discover the challenges and solutions for implementing a highly available and scalable fine-grained monitoring system capable of handling 4 million TPS and 10K requests per second. Learn about scaling Prometheus for large-scale scenarios, optimizing query latency across multiple Prometheus instances using Thanos, and gain insights into effective configuration practices for target discovery, recording rules, and alerting rules. Delve into the experiences and lessons learned by Alibaba's team in developing a robust monitoring infrastructure for their cluster management system.

Syllabus

High Available + Scalable Prometheus with Thanos in Alibaba - Guo'an Qin, Alibaba & Tao Li, Alibaba


Taught by

Linux Foundation

Tags

Related Courses

Optimizing Microsoft Windows Server Storage
Microsoft via edX
High Availability and Disaster Recovery with the SAP HANA Platform
SAP Learning
Microsoft Exchange Server 2016 - 3: Mailbox Databases
Microsoft via edX
Microsoft SharePoint 2016: Workload Optimization
Microsoft via edX
Microsoft Azure Virtual Machines
Microsoft via edX