YoVDO

From Nothing to SRE - Practical Guidance

Offered By: USENIX via YouTube

Tags

SREcon Courses Risk Management Courses Organizational Culture Courses

Course Description

Overview

Explore practical guidance on implementing Site Reliability Engineering (SRE) in smaller organizations through this 26-minute conference talk from SREcon19 Europe/Middle East/Africa. Delve into the journey of building an SRE team from scratch, addressing unique challenges faced by tech teams with 50 or fewer engineers. Learn how to gain buy-in for SRE, foster a culture of continual experimentation, and navigate potential blindspots. Discover strategies for managing on-call responsibilities, aligning incentives, and adapting SRE principles to smaller-scale operations. Gain insights on balancing risk management, avoiding burnout, and cultivating operational excellence in resource-constrained environments. Understand the relevance of SRE for organizations of all sizes and how to overcome common obstacles in its adoption.

Syllabus

Intro
Where do I start
Oncall
Why SRE is relevant
Common challenges
IT Software Engineering
Complexity
Risk
Where do we start
Culture
The Conventional Way
The Alternative Way
The Handoff Word
Ryan Kitchens Netflix
Reliability
Incentives
Gates
Carrot not the stick
SRE is a force multiplier
How users feel matters
Unsolved problems
Architecture
SRE Community
Questions


Taught by

USENIX

Related Courses

How to Not Destroy Your Production Kubernetes Clusters
USENIX via YouTube
SRE and ML - Why It Matters
USENIX via YouTube
Knowledge and Power - A Sociotechnical Systems Discussion on the Future of SRE
USENIX via YouTube
Tracing Bare Metal with OpenTelemetry
USENIX via YouTube
Improving How We Observe Our Observability Data - Techniques for SREs
USENIX via YouTube