YoVDO

Architecting Resilience: Lessons from Managing 7000+ Kubernetes Clusters at Scale

Offered By: CNCF [Cloud Native Computing Foundation] via YouTube

Tags

Kubernetes Courses DevOps Courses Distributed Systems Courses Disaster Recovery Courses Scalability Courses High Availability Courses Cluster Management Courses Infrastructure Management Courses Cloud-Native Architecture Courses Resilience Engineering Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the challenges and solutions in managing over 7,000 Kubernetes clusters at scale in this conference talk from KubeCon + CloudNativeCon. Gain insights into architecting resilient systems as Kakao's private Kubernetes as a Service team members share their experiences following a significant data center fire. Learn about the economic and social impacts of the incident, and discover the team's approach to providing highly available Kubernetes clusters efficiently for developers. Delve into design ideas for cluster high-availability, implementation challenges, and concerns encountered while managing a vast infrastructure of 100,000+ nodes. Understand the importance of resilience in cloud-native environments and how to apply these lessons to your own Kubernetes deployments.

Syllabus

Architecting Resilience: Lessons from Managing 7K+ Kubernetes Clusters at Scale


Taught by

CNCF [Cloud Native Computing Foundation]

Related Courses

DevOps Foundations: Effective Postmortems
LinkedIn Learning
Identifying Hidden Dependencies
USENIX via YouTube
When -bin-sh Attacks - Revisiting "Automate All the Things"
USENIX via YouTube
Fault Tree Analysis Applied to Apache Kafka
USENIX via YouTube
Introduction to Chaos Engineering With LitmusChaos
Kunal Kushwaha via YouTube