Challenges of Starting an SRE Team from Scratch in an Enterprise
Offered By: USENIX via YouTube
Course Description
Overview
Explore the challenges and strategies of establishing a Site Reliability Engineering (SRE) team within a large enterprise in this 33-minute conference talk from SREcon20 Americas. Discover how BT approached implementing SRE principles by first addressing critical business concerns such as security, cloud sprawl, and cost control. Learn about the complexities of creating an SRE team beyond simply renaming an existing operations team or copying Google's model. Gain insights into the journey of building an SRE team, including the main obstacles faced in a corporate environment, key learnings, and valuable advice for newly formed SRE teams. Understand the importance of tailoring SRE practices to specific business needs and establishing unique standards. Follow the structured presentation as it covers topics like cloud computing standards, chaos engineering, and integrating SRE into engineering processes.
Syllabus
Introduction
Presentation Overview
BT Background
Waynes Background
Waynes Story
Where to Start
Challenges
Management
The Solution
Goals
Planning and Organizing
Cloud Sprawl
Cloud Computing Standards
Chaos Engineering
SRE at BT
SRE in Engineering
Summary
Conclusion
Taught by
USENIX
Related Courses
DevOps Foundations: Chaos EngineeringLinkedIn Learning Practical Chaos Engineering - Breaking Things on Purpose to Make Them More Resilient Against Failure
NDC Conferences via YouTube Patterns for Resilient Architecture
NDC Conferences via YouTube Antics, Drift, and Chaos
Strange Loop Conference via YouTube The Smallest Possible SRE Team
USENIX via YouTube