High Available and Scalable Prometheus with Thanos in Alibaba
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore how Alibaba Group leverages Kubernetes, Prometheus, and Thanos to support their massive e-commerce operations in this conference talk. Discover the challenges and solutions for implementing a highly available and scalable fine-grained monitoring system capable of handling 4 million TPS and 10K requests per second. Learn about scaling Prometheus for large-scale scenarios, optimizing query latency across multiple Prometheus instances using Thanos, and gain insights into effective configuration practices for target discovery, recording rules, and alerting rules. Delve into the experiences and lessons learned by Alibaba's team in developing a robust monitoring infrastructure for their cluster management system.
Syllabus
High Available + Scalable Prometheus with Thanos in Alibaba - Guo'an Qin, Alibaba & Tao Li, Alibaba
Taught by
Linux Foundation
Tags
Related Courses
Financial Sustainability: The Numbers side of Social Enterprise+Acumen via NovoEd Cloud Computing Concepts: Part 2
University of Illinois at Urbana-Champaign via Coursera Developing Repeatable ModelsĀ® to Scale Your Impact
+Acumen via Independent Managing Microsoft Windows Server Active Directory Domain Services
Microsoft via edX Introduction aux conteneurs
Microsoft Virtual Academy via OpenClassrooms