YoVDO

Cattle Not Pets - Investigating Failed Nodes Before Deletion

Offered By: Linux Foundation via YouTube

Tags

Kubernetes Courses Root Cause Analysis Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the nuanced approach to handling Kubernetes node failures in this conference talk. Learn why immediate deletion of failed nodes may hinder root cause analysis and prevention of future issues. Discover an alternative strategy that balances proper failover with the need for thorough investigation. Examine how existing projects handle node failures and gain insights into a proposed implementation leveraging External Remediation in MachineHealthCheck and fencing technologies like fence_kdump. Understand the importance of preserving failed nodes as valuable sources of information for engineers investigating system issues in cloud native environments.

Syllabus

Cattle Not Pets, but Don't Delete It Until Investigated - Masaki Kimura & Keisuke Saito, Hitachi


Taught by

Linux Foundation

Tags

Related Courses

Introduction to Cloud Infrastructure Technologies
Linux Foundation via edX
Scalable Microservices with Kubernetes
Google via Udacity
Google Cloud Fundamentals: Core Infrastructure
Google via Coursera
Introduction to Kubernetes
Linux Foundation via edX
Fundamentals of Containers, Kubernetes, and Red Hat OpenShift
Red Hat via edX