Disaster Recovery: Rebuilding Production in Under 1 Hour Using KOps, ArgoCD and Velero
Offered By: CNCF [Cloud Native Computing Foundation] via YouTube
Course Description
Overview
Explore a real-life disaster recovery scenario in this conference talk that details how a production environment was rebuilt from scratch in under an hour using KOps, ArgoCD, and Velero. Learn about the operational incident caused by misconfiguration, the challenges faced when standard backup and recovery methods failed, and the crucial role of GitOps and infrastructure as code in the recovery process. Discover the unexpected issues encountered during the 51-minute cluster recreation, including tool malfunctions and outdated disaster recovery guides. Gain insights into the workarounds employed, post-incident improvements, and valuable lessons learned. Understand the importance of disaster recovery planning, the benefits of migrating to GitOps and ArgoCD, and how to streamline deployment processes for faster recovery times.
Syllabus
Introduction
Welcome
About me
About Ada
The Incident
Cluster Management Tool
Disaster Recovery Planning
What is Valero
Making a Decision
Argo CD
Total Outage
The Problem
What Did We Learn
Moving to Argo CD
Single Workflow
Deployment Process
Demo
Questions
Git Repo
Taught by
CNCF [Cloud Native Computing Foundation]
Related Courses
Hands-on Introduction to Velero - Backup and Restore for KubernetesRawkode Academy via YouTube Velero - The Cloud Native Backup for Kubernetes
Linux Foundation via YouTube Velero - Open Source Kubernetes Backup and Recovery Tool
CNCF [Cloud Native Computing Foundation] via YouTube Navigating Disaster Recovery in Kubernetes and Crossplane - Strategies and Tools for Managing Stateful Workloads
CNCF [Cloud Native Computing Foundation] via YouTube From Pre-Population to Disasters: Managing and Protecting VM State in KubeVirt
CNCF [Cloud Native Computing Foundation] via YouTube