Planning for and Handling Failures - From Open Hardware and Aviation to Production at Google
Offered By: linux.conf.au via YouTube
Course Description
Overview
Explore a comprehensive analysis of failure management across diverse fields in this 46-minute conference talk from linux.conf.au. Delve into real-world examples from open hardware, aviation, and Google's production environment to gain valuable insights on anticipating, preventing, and learning from failures. Discover practical strategies for developing a keen sense for potential issues, implementing effective procedures, and conducting thorough root cause analyses. Learn from critical incidents in aviation, such as AF447 and QF32, and understand the implications of automation gone wrong. Gain knowledge on avoiding hardware mishaps, improving software development practices, and the importance of proper postmortems. This talk equips you with essential skills to enhance your approach to risk management and failure prevention across various technological domains.
Syllabus
Intro
Managing failures
Eusebio
Be mindful
Hardware
Phone
Spare to spare
Software
Code Reviews
Change Requests
Unit tests
Continuous integration
File updates
Postmortems
Practicing emergencies
Have backups be careful
Disk Erase
Rate Limits
Postmortem
Personal Lessons
Aviation Lessons
Risk Management
Post Mortem
Automation
Selfdriving cars
Air France 447
Airbus QF32
Indonesia
Aircraft accident
Boeing
Certification
Make a difference
Conclusions
QA
Taught by
linux.conf.au
Related Courses
6.S094: Deep Learning for Self-Driving CarsMassachusetts Institute of Technology via Independent An Introduction to Practical Deep Learning
Intel via Coursera Self-Driving Fundamentals: Featuring Apollo
Baidu via Udacity Self-Driving Cars Teach-Out
University of Michigan via Coursera Visual Perception for Self-Driving Cars
University of Toronto via Coursera