Zonal Outage Operational Stories in Kubernetes Clusters
Offered By: CNCF [Cloud Native Computing Foundation] via YouTube
Course Description
Overview
Explore real-world zonal outage scenarios and their impact on Kubernetes clusters in this insightful conference talk by Jyoti Ranjan Mahapatra and Shyam Jeedigunta from Amazon Web Services. Gain valuable insights into the concept of availability zones as failure domains and learn how Kubernetes cluster administrators deploy components to achieve high availability. Discover the behavior of Kubernetes components during various types of zonal failures, from partial to full outages, including network partitions, power loss, reboots, and software deployment issues. Benefit from the speakers' extensive experience in operating a large fleet of Kubernetes control planes in AWS as they share operational stories and improvements that have enhanced resiliency for thousands of clusters. Understand the importance of topological spread in tolerating single fault domain failures and ensuring graceful handling of common zonal issues.
Syllabus
Zonal Outage Operational Stories - Jyoti Ranjan Mahapatra & Shyam Jeedigunta, Amazon Web Services
Taught by
CNCF [Cloud Native Computing Foundation]
Related Courses
Emergency ManagementOpen2Study Resilience in Children Exposed to Trauma, Disaster and War: Global Perspectives
University of Minnesota via Coursera MongoDB Advanced Deployment and Operations
MongoDB University Arch403: Designing Resilient Schools
Build Academy via EdCast Bases de données relationnelles : Comprendre pour maîtriser
Inria (French Institute for Research in Computer Science and Automation) via France Université Numerique