Observability and the Future of Complex Systems
Offered By: ChariotSolutions via YouTube
Course Description
Overview
Syllabus
Intro
And the problem space is complex.
Write workload, trailing year
Read workload, trailing year
Service Level Objectives (SLO)
Data storage engine and analytics flow
SLOs are user flows
Service-Level Objectives
Functional and visual testing.
Design for feature flag deployment.
Automated integration & human review.
Green button merge.
Auto-updates, rollbacks, & pins.
Observe behavior in prod.
Non-trivial savings.
Three case studies of failure
1 Shepherd: ingest API service
Honeycomb Ingest Outage
Now what?
Kafka: data bus
Our month of Kafka pain
Unexpected constraints
Take care of your people
Optimize for safety
Retriever: query service
Making progress carefully
Takeaways
Acknowledge hidden risks
Make experimentation routine!
Understand & control production.
Taught by
ChariotSolutions
Related Courses
Developing a Google SRE Culture - Português BrasileiroGoogle Cloud via Coursera Developing a Google SRE Culture - Español
Google Cloud via Coursera Google Cloud Customer Care Fundamentals
Google Cloud via Coursera Developing a Google SRE Culture - Locales
Google via Google Cloud Skills Boost Developing a Google SRE Culture
Google Cloud via Coursera