YoVDO

Hard Problems We Handle in Incidents but Aren't Recognized

Offered By: USENIX via YouTube

Tags

SREcon Courses Recruiting Courses

Course Description

Overview

Explore the often-overlooked challenges and dynamics in incident management during this 31-minute conference talk from SREcon21. Delve into the complexities of diagnostic work, recruiting, status reporting, and coordination costs. Examine the dilemmas and sacrifices made by incident responders, including the divide-and-conquer approach and decision-making under pressure. Gain insights into hindsight bias and parallel incident handling, while learning descriptive vocabulary to better recognize and address these hidden aspects in future incidents. Enhance your understanding of the intricate problem-solving processes involved in effective incident response.

Syllabus

Intro
Title
Diagnostic Work
Recruiting
Status Reporting
Costs of Coordination
The Dilemma
Divide Conquer
Sacrifice Decisions
hindsight bias
parallel incidents
chat transcript


Taught by

USENIX

Related Courses

How to Not Destroy Your Production Kubernetes Clusters
USENIX via YouTube
SRE and ML - Why It Matters
USENIX via YouTube
Knowledge and Power - A Sociotechnical Systems Discussion on the Future of SRE
USENIX via YouTube
Tracing Bare Metal with OpenTelemetry
USENIX via YouTube
Improving How We Observe Our Observability Data - Techniques for SREs
USENIX via YouTube