Human Observability of Incident Response
Offered By: USENIX via YouTube
Course Description
Overview
Explore the critical aspects of human observability in incident response through this 39-minute conference talk from SREcon23 Americas. Delve into the interconnected nature of socio-technical systems and learn how understanding human factors can lead to more effective incident management. Discover practical advice for producing consistently better outcomes by focusing on the human element in incident response. Gain insights from Matt Davis of FORM.com on how learning from incidents also involves learning from and about team members. Understand the importance of coordinating responses with a deep comprehension of both technical systems and human dynamics. Access the full range of SREcon23 Americas Technical Sessions for a comprehensive look at current trends and best practices in site reliability engineering.
Syllabus
SREcon23 Americas - Human Observability of Incident Response
Taught by
USENIX
Related Courses
How to Not Destroy Your Production Kubernetes ClustersUSENIX via YouTube SRE and ML - Why It Matters
USENIX via YouTube Knowledge and Power - A Sociotechnical Systems Discussion on the Future of SRE
USENIX via YouTube Tracing Bare Metal with OpenTelemetry
USENIX via YouTube Improving How We Observe Our Observability Data - Techniques for SREs
USENIX via YouTube