Challenges of Starting an SRE Team from Scratch in an Enterprise
Offered By: USENIX via YouTube
Course Description
Overview
Explore the challenges and strategies of establishing a Site Reliability Engineering (SRE) team within a large enterprise in this 33-minute conference talk from SREcon20 Americas. Discover how BT approached implementing SRE principles by first addressing critical business concerns such as security, cloud sprawl, and cost control. Learn about the complexities of creating an SRE team beyond simply renaming an existing operations team or copying Google's model. Gain insights into the journey of building an SRE team, including the main obstacles faced in a corporate environment, key learnings, and valuable advice for newly formed SRE teams. Understand the importance of tailoring SRE practices to specific business needs and establishing unique standards. Follow the structured presentation as it covers topics like cloud computing standards, chaos engineering, and integrating SRE into engineering processes.
Syllabus
Introduction
Presentation Overview
BT Background
Waynes Background
Waynes Story
Where to Start
Challenges
Management
The Solution
Goals
Planning and Organizing
Cloud Sprawl
Cloud Computing Standards
Chaos Engineering
SRE at BT
SRE in Engineering
Summary
Conclusion
Taught by
USENIX
Related Courses
How to Not Destroy Your Production Kubernetes ClustersUSENIX via YouTube SRE and ML - Why It Matters
USENIX via YouTube Knowledge and Power - A Sociotechnical Systems Discussion on the Future of SRE
USENIX via YouTube Tracing Bare Metal with OpenTelemetry
USENIX via YouTube Improving How We Observe Our Observability Data - Techniques for SREs
USENIX via YouTube