Cattle Not Pets - Investigating Failed Nodes Before Deletion
Offered By: Linux Foundation via YouTube
Course Description
Overview
Explore the nuanced approach to handling Kubernetes node failures in this conference talk. Learn why immediate deletion of failed nodes may hinder root cause analysis and prevention of future issues. Discover an alternative strategy that balances proper failover with the need for thorough investigation. Examine how existing projects handle node failures and gain insights into a proposed implementation leveraging External Remediation in MachineHealthCheck and fencing technologies like fence_kdump. Understand the importance of preserving failed nodes as valuable sources of information for engineers investigating system issues in cloud native environments.
Syllabus
Cattle Not Pets, but Don't Delete It Until Investigated - Masaki Kimura & Keisuke Saito, Hitachi
Taught by
Linux Foundation
Tags
Related Courses
Introduction to Cloud Infrastructure TechnologiesLinux Foundation via edX Scalable Microservices with Kubernetes
Google via Udacity Google Cloud Fundamentals: Core Infrastructure
Google via Coursera Introduction to Kubernetes
Linux Foundation via edX Fundamentals of Containers, Kubernetes, and Red Hat OpenShift
Red Hat via edX