YoVDO

Cattle Not Pets - Investigating Failed Nodes Before Deletion

Offered By: Linux Foundation via YouTube

Tags

Kubernetes Courses Root Cause Analysis Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the nuanced approach to handling Kubernetes node failures in this conference talk. Learn why immediate deletion of failed nodes may hinder root cause analysis and prevention of future issues. Discover an alternative strategy that balances proper failover with the need for thorough investigation. Examine how existing projects handle node failures and gain insights into a proposed implementation leveraging External Remediation in MachineHealthCheck and fencing technologies like fence_kdump. Understand the importance of preserving failed nodes as valuable sources of information for engineers investigating system issues in cloud native environments.

Syllabus

Cattle Not Pets, but Don't Delete It Until Investigated - Masaki Kimura & Keisuke Saito, Hitachi


Taught by

Linux Foundation

Tags

Related Courses

Fixing Healthcare Delivery
University of Florida via Coursera
Effective Problem-Solving and Decision-Making
University of California, Irvine via Coursera
Process Improvement
University of Illinois at Urbana-Champaign via Coursera
مهارات حل المشكلات واتخاذ القرارات
Edraak
Six Sigma Part 2: Analyze, Improve, Control
Technische Universität München (Technical University of Munich) via edX