Managing Chaos at Scale
Offered By: WeAreDevelopers via YouTube
Course Description
Overview
Explore Uber's journey in maintaining reliability during explosive growth from a few to thousands of microservices in this 45-minute conference talk by Paweł Królikowski. Dive into incident prevention strategies, including integration testing, load testing, chaos testing, blackbox testing, and rollout strategies. Learn about effective incident response techniques, covering on-call procedures, monitoring systems, alerting mechanisms, and mitigation strategies. Gain insights into the benefits of using common frameworks in reliability engineering. Discover what Uber did right and the valuable lessons learned through experience in managing chaos at scale.
Syllabus
Managing Chaos at Scale
Taught by
WeAreDevelopers
Related Courses
Information Security Management in a NutshellSAP Learning Identifying, Monitoring, and Analyzing Risk and Incident Response and Recovery
(ISC)² via Coursera Enterprise Security Fundamentals
Microsoft via edX Planning a Security Incident Response
Microsoft via edX Introduction to Cybersecurity
Udacity