Cattle Not Pets - Investigating Failed Nodes Before Deletion
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore the nuanced approach to handling Kubernetes node failures in this conference talk. Learn why immediate deletion of failed nodes may hinder root cause analysis and prevention of future issues. Discover an alternative strategy that balances proper failover with the need for thorough investigation. Examine how existing projects handle node failures and gain insights into a proposed implementation leveraging External Remediation in MachineHealthCheck and fencing technologies like fence_kdump. Understand the importance of preserving failed nodes as valuable sources of information for engineers investigating system issues in cloud native environments.
Syllabus
Cattle Not Pets, but Don't Delete It Until Investigated - Masaki Kimura & Keisuke Saito, Hitachi
Taught by
Linux Foundation
Tags
Related Courses
Fixing Healthcare DeliveryUniversity of Florida via Coursera Effective Problem-Solving and Decision-Making
University of California, Irvine via Coursera Process Improvement
University of Illinois at Urbana-Champaign via Coursera مهارات حل المشكلات واتخاذ القرارات
Edraak Six Sigma Part 2: Analyze, Improve, Control
Technische Universität München (Technical University of Munich) via edX