YoVDO

Cluster Golden Signals: Avoiding Alert Fatigue at Scale

Offered By: Linux Foundation via YouTube

Tags

Kubernetes Courses DevOps Courses Incident Response Courses Anomaly Detection Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a comprehensive approach to managing Kubernetes cluster metrics and alerts in this 46-minute conference talk by Anusha Ragunathan and Sahil Badla from Intuit Inc. Learn how to apply the industry-standard "Golden Signals" concept to Kubernetes clusters, effectively reducing alert fatigue for platform engineers and SREs. Discover the architecture and components of a successful metrics pipeline that derives baseline behaviors and detects anomalies. Through a simulated incident demonstration, understand how cluster golden signals can differentiate between service and platform issues, enabling efficient incident isolation and remediation. Gain valuable insights and best practices for implementing this system at scale, based on real-world production experience.

Syllabus

Cluster Golden Signals to Avoid Alert Fatigue at Scale - Anusha Ragunathan & Sahil Badla, Intuit Inc


Taught by

Linux Foundation

Tags

Related Courses

Startup Engineering
Stanford University via Coursera
Developing Scalable Apps in Java
Google via Udacity
Cloud Computing Concepts, Part 1
University of Illinois at Urbana-Champaign via Coursera
Cloud Networking
University of Illinois at Urbana-Champaign via Coursera
Cloud Computing Concepts: Part 2
University of Illinois at Urbana-Champaign via Coursera