YoVDO

Debugging Complex Kubernetes Incidents - When It's Not DNS

Offered By: CNCF [Cloud Native Computing Foundation] via YouTube

Tags

Kubernetes Courses Routing Courses Ingress Courses VPC Flow Logs Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Dive into a complex Kubernetes incident investigation in this conference talk. Follow the journey of troubleshooting mysterious service errors during rolling updates, initially suspected to be DNS-related. Explore the debugging steps, from analyzing application behavior and DNS setup to investigating networking issues and VPC flow logs. Uncover the intricacies of ingress and egress flows, routing on nodes, and the impact of reverse path filtering. Examine the RPC setup, DNS propagation time during rollouts, and reconnection differences. Learn valuable lessons from this in-depth exploration of a challenging issue that ultimately led to a simple three-line code removal solution.

Syllabus

Intro
Metries service errors during rollouts
Applications involved
DNS setup
Too many queries at startup?
Networking issues?
Let's test with network optimized instances
What about bigger instances?
VPC Flow Logs
Zoom on ingress flows to old IP
What about egress?
Routing on nodes
Stable state
What about traffic to old IP?
Let's simulate
Reverse Path filtering
2 questions
RPC setup
DNS propagation time during Rollouts
Reconnection differences
Lessons Learned


Taught by

CNCF [Cloud Native Computing Foundation]

Related Courses

Computer Networks
University of Washington via Coursera
Cloud Networking
University of Illinois at Urbana-Champaign via Coursera
Front End Frameworks
Google via Udacity
Build a Simple Dynamic Site with Node.js Course (How To)
Treehouse
VLSI Physical Design
Indian Institute of Technology, Kharagpur via Swayam