Planning for and Handling Failures - From Open Hardware and Aviation to Production at Google
Offered By: linux.conf.au via YouTube
Course Description
Overview
Explore a comprehensive analysis of failure management across diverse fields in this 46-minute conference talk from linux.conf.au. Delve into real-world examples from open hardware, aviation, and Google's production environment to gain valuable insights on anticipating, preventing, and learning from failures. Discover practical strategies for developing a keen sense for potential issues, implementing effective procedures, and conducting thorough root cause analyses. Learn from critical incidents in aviation, such as AF447 and QF32, and understand the implications of automation gone wrong. Gain knowledge on avoiding hardware mishaps, improving software development practices, and the importance of proper postmortems. This talk equips you with essential skills to enhance your approach to risk management and failure prevention across various technological domains.
Syllabus
Intro
Managing failures
Eusebio
Be mindful
Hardware
Phone
Spare to spare
Software
Code Reviews
Change Requests
Unit tests
Continuous integration
File updates
Postmortems
Practicing emergencies
Have backups be careful
Disk Erase
Rate Limits
Postmortem
Personal Lessons
Aviation Lessons
Risk Management
Post Mortem
Automation
Selfdriving cars
Air France 447
Airbus QF32
Indonesia
Aircraft accident
Boeing
Certification
Make a difference
Conclusions
QA
Taught by
linux.conf.au
Related Courses
Introduction to FinanceUniversity of Michigan via Coursera Information Security and Risk Management in Context
University of Washington via Coursera Financial Engineering and Risk Management
Columbia University via Coursera Building an Information Risk Management Toolkit
University of Washington via Coursera Caries Management by Risk Assessment (CAMBRA)
University of California, San Francisco via Coursera