Site Reliability Engineering (SRE): The Big Picture
Offered By: Pluralsight
Course Description
Overview
SRE is how Google runs production systems, promoting high availability
with high velocity and removing operational toil. It achieves the same
goals as DevOps without the culture shift, so it's a better option for many
digital transformations.
Site Reliability Engineering (SRE) is a set of principles and practices that supports software delivery - keeping production systems stable and still delivering new features at speed. In this course, Site Reliability Engineering (SRE): The Big Picture, you 'll get a thorough overview of how SRE works and why it's a good choice for many organizations. First, you'll learn the differences between SRE, DevOps, and traditional operations. Next, you'll discover how engineering practices help to reduce toil and provide more time to focus on high value tasks. Finally, you'll learn how SRE approaches monitoring and alerting, and about the SRE approach to managing incidents. When you're finished with this course, you'll be able to evaluate SRE and see if it's a good fit for your organization.
with high velocity and removing operational toil. It achieves the same
goals as DevOps without the culture shift, so it's a better option for many
digital transformations.
Site Reliability Engineering (SRE) is a set of principles and practices that supports software delivery - keeping production systems stable and still delivering new features at speed. In this course, Site Reliability Engineering (SRE): The Big Picture, you 'll get a thorough overview of how SRE works and why it's a good choice for many organizations. First, you'll learn the differences between SRE, DevOps, and traditional operations. Next, you'll discover how engineering practices help to reduce toil and provide more time to focus on high value tasks. Finally, you'll learn how SRE approaches monitoring and alerting, and about the SRE approach to managing incidents. When you're finished with this course, you'll be able to evaluate SRE and see if it's a good fit for your organization.
Syllabus
- Course Overview 2mins
- Introducing Site Reliability Engineering 27mins
- Automation and Eliminating Toil 30mins
- Service Levels, Monitoring, and Alerting 28mins
- Incident Management: On-call and Postmortems 22mins
Taught by
Elton Stoneman
Related Courses
Startup EngineeringStanford University via Coursera Developing Scalable Apps in Java
Google via Udacity Cloud Computing Concepts, Part 1
University of Illinois at Urbana-Champaign via Coursera Cloud Networking
University of Illinois at Urbana-Champaign via Coursera Cloud Computing Concepts: Part 2
University of Illinois at Urbana-Champaign via Coursera