One on One SRE
Offered By: USENIX via YouTube
Course Description
Overview
Syllabus
Intro
trauma: extreme stress that overwhelms a person's ability to cope
insufficient guard rails
unknown unknowns
The 1:1 Incident Debrief
informed consent
what was your role in the incident?
how long did you work on the incident?
were you able to get the support you needed?
do you feel that the incident was preventable?
what actions do you feel good about?
what do you think could have been better?
what did you learn from this incident?
what do you think we can do to prevent reoccurrence?
did our tools and documentation serve you well?
did you practice self-care during this process?
can you think of anyone else we should talk to?
spanning tree
deviant behavior
How can I, an individual contributor, impact reliability at organizational scale?
Taught by
USENIX
Related Courses
How to Not Destroy Your Production Kubernetes ClustersUSENIX via YouTube SRE and ML - Why It Matters
USENIX via YouTube Knowledge and Power - A Sociotechnical Systems Discussion on the Future of SRE
USENIX via YouTube Tracing Bare Metal with OpenTelemetry
USENIX via YouTube Improving How We Observe Our Observability Data - Techniques for SREs
USENIX via YouTube