Meatbag Systems - How Our Reliability Culture & Practice Evolved over Time
Offered By: USENIX via YouTube
Course Description
Overview
Explore the evolution of reliability culture and practices in a 24-minute conference talk from SREcon22 EMEA. Delve into the journey of Zalando's reliability improvement, focusing on the human aspects of maintaining production systems. Examine three major incidents that shaped the organization's approach to reliability, and learn how they developed a common language, culture, and support network to enhance operational maturity. Gain insights into aligning reliability efforts with customer value and creating a shared understanding of "sufficiently reliable" systems. Discover how the interplay between technical systems and the people who maintain them impacts overall reliability and customer satisfaction.
Syllabus
SREcon22 EMEA - Meatbag Systems: How Our Reliability Culture & Practice Evolved over Time
Taught by
USENIX
Related Courses
How to Not Destroy Your Production Kubernetes ClustersUSENIX via YouTube SRE and ML - Why It Matters
USENIX via YouTube Knowledge and Power - A Sociotechnical Systems Discussion on the Future of SRE
USENIX via YouTube Tracing Bare Metal with OpenTelemetry
USENIX via YouTube Improving How We Observe Our Observability Data - Techniques for SREs
USENIX via YouTube