Rolling out Error Budgets Across a 1000 Person Global Engineering Org
Offered By: GOTO Conferences via YouTube
Course Description
Overview
Explore how Zendesk implemented Site Reliability Engineering (SRE) concepts, specifically Error Budgets and SLOs/SLIs, across their global engineering organization of 1000 people. Learn about the challenges faced in addressing major outages, the impact of company-wide change freezes, and the journey towards improving reliability. Gain practical insights into tooling and practices for implementing Error Budgets, as well as strategies for scoping freezes to systems with more reliability issues. Discover the wins and ongoing challenges in this 32-minute conference talk from YOW! 2019, presented by John Viner, Senior Director of Engineering at Zendesk.
Syllabus
Rolling out Error Budgets Across a 1000 Person Global Engineering Org. • John Viner • YOW! 2019
Taught by
GOTO Conferences
Related Courses
DevOps Foundations: Chaos EngineeringLinkedIn Learning Practical Chaos Engineering - Breaking Things on Purpose to Make Them More Resilient Against Failure
NDC Conferences via YouTube Patterns for Resilient Architecture
NDC Conferences via YouTube Antics, Drift, and Chaos
Strange Loop Conference via YouTube Challenges of Starting an SRE Team from Scratch in an Enterprise
USENIX via YouTube