Managing Chaos at Scale
Offered By: WeAreDevelopers via YouTube
Course Description
Overview
Explore Uber's journey in maintaining reliability during explosive growth from a few to thousands of microservices in this 45-minute conference talk by Paweł Królikowski. Dive into incident prevention strategies, including integration testing, load testing, chaos testing, blackbox testing, and rollout strategies. Learn about effective incident response techniques, covering on-call procedures, monitoring systems, alerting mechanisms, and mitigation strategies. Gain insights into the benefits of using common frameworks in reliability engineering. Discover what Uber did right and the valuable lessons learned through experience in managing chaos at scale.
Syllabus
Managing Chaos at Scale
Taught by
WeAreDevelopers
Related Courses
Implementing SRE in a Regulated EnvironmentUSENIX via YouTube Testing - Is This Thing On(line)? Meet Your New Microsoft Testing Tools
NDC Conferences via YouTube How to Do In-App Chaos Testing
NDC Conferences via YouTube Introduction to Cloud Native Chaos Engineering
Kunal Kushwaha via YouTube Case Study - Improving Resilience of Applications in Telco Environments
CNCF [Cloud Native Computing Foundation] via YouTube