Fast Recovery of Container Failures at Large Scale - The 1-5-10 Approach
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore Alibaba's innovative 1-5-10 theory for fast container failure recovery at scale in this informative conference talk. Delve into the challenges of maintaining container reliability in the cloud era as applications grow rapidly. Learn how to detect problems within 1 minute, identify issues within 5 minutes, and resolve failures within 10 minutes. Discover techniques for building an efficient local agent for quick problem detection, implementing intelligent diagnostics using expert knowledge bases, and automating container problem recovery through a failure-driven approach. Gain valuable insights into increasing the reliability of large-scale container deployments without increasing resource investment.
Syllabus
1-5-10: How to Fast Recover Container Failure at Large Scale - XiongHuan, Alibaba
Taught by
Linux Foundation
Tags
Related Courses
Startup EngineeringStanford University via Coursera Developing Scalable Apps in Java
Google via Udacity Cloud Computing Concepts, Part 1
University of Illinois at Urbana-Champaign via Coursera Cloud Networking
University of Illinois at Urbana-Champaign via Coursera Cloud Computing Concepts: Part 2
University of Illinois at Urbana-Champaign via Coursera