YoVDO

Using Chaos Engineering to Ensure Kubernetes Reliability

Offered By: Linux Foundation via YouTube

Tags

Kubernetes Courses Chaos Engineering Courses

Course Description

Overview

Explore chaos engineering techniques to enhance Kubernetes reliability in this 46-minute Linux Foundation webinar sponsored by Gremlin. Discover how to leverage Kubernetes's resiliency features and achieve reliability goals while addressing the increased complexity of dynamic environments. Learn about Kubernetes from both operations and developer perspectives, understand the scientific approach to chaos engineering, and delve into topics such as multizone clusters, node problem detection, front-end chaos engineering, network issues, and horizontal pod scaling. Gain insights on automation, resources, tools, and the importance of controlled testing over random crashing. Explore the intersection of chaos engineering with machine learning algorithms to ensure robust and reliable Kubernetes deployments.

Syllabus

Introduction
Meet Mara
Kubernetes from an Ops perspective
Kubernetes from a Developer perspective
Kubernetes complexity
What is chaos engineering
Chaos engineering is science
Observation
Safety
Kubernetes
Master Nodes
Multizone Cluster
Node Problem Detector
Front End Chaos Engineering
Network Issues
Horizontal Pod Scaling
Recap
Automation
Resources
Tools
Why is random crashing bad
Machine learning algorithms


Taught by

Linux Foundation

Tags

Related Courses

DevOps Foundations: Chaos Engineering
LinkedIn Learning
Practical Chaos Engineering - Breaking Things on Purpose to Make Them More Resilient Against Failure
NDC Conferences via YouTube
Patterns for Resilient Architecture
NDC Conferences via YouTube
Antics, Drift, and Chaos
Strange Loop Conference via YouTube
Challenges of Starting an SRE Team from Scratch in an Enterprise
USENIX via YouTube