Automating Disaster Recovery: The Ultimate Reliability Challenge
Offered By: USENIX via YouTube
Course Description
Overview
Explore a groundbreaking approach to disaster recovery automation in this 25-minute conference talk from SREcon24 Americas. Delve into the journey of a Cisco Systems Inc. team as they tackle the ultimate reliability challenge: automating response to catastrophic events like a meteor strike on a data center. Learn how the team overcame initial skepticism to successfully implement a fully automated disaster recovery system. Gain insights into the importance of adopting a sociotechnical perspective when addressing complex, wide-surface problems in IT infrastructure. Discover how this innovative approach can prepare organizations to handle unforeseen challenges and enhance overall system reliability. Walk away with valuable lessons on pushing the boundaries of what's possible in disaster recovery and site reliability engineering.
Syllabus
SREcon24 Americas - Automating Disaster Recovery: The Ultimate Reliability Challenge
Taught by
USENIX
Related Courses
Introduction to FinanceUniversity of Michigan via Coursera Information Security and Risk Management in Context
University of Washington via Coursera Financial Engineering and Risk Management
Columbia University via Coursera Building an Information Risk Management Toolkit
University of Washington via Coursera Caries Management by Risk Assessment (CAMBRA)
University of California, San Francisco via Coursera