Why Are Distributed Systems So Hard?
Offered By: USENIX via YouTube
Course Description
Overview
Syllabus
Introduction
Agenda
Storytime
Data Evolution
Scaling
Cloud Computing
Why Scale Horizontally
What Does It Mean To Run A Distributed System
A Node On Distributed Computing
Summary
Shared Nothing Architecture
Unreliable Message Delivery
Why Are We Fenced Off
Building Observability
What We Can Know
The Cap Theorem
C
Replication Lag
Consistency is a Spectrum
Availability is Not Binary
Partition Tolerance
Hardware
Hardware Failure
Cables
Sharks
Kevlar
Network Partitions
Resource Isolation
Process Suspension
Network Glitch
People do bad things
Why does this matter
Practical reality
The correctness result
Mitigation strategies
Consensus Algorithms
The Woods Theorem
Building Mental Models
Incident Analysis
Blameless Discussions
Mental Models
Human Failure
Alert Fatigue
User Mindsets
Designing Systems for Humans
HugOps
Taught by
USENIX
Related Courses
Reliable Distributed Algorithms - Part 2KTH Royal Institute of Technology via edX Maîtrisez les bases de données NoSQL
CentraleSupélec via OpenClassrooms Big Data: adquisición y almacenamiento de datos
Universitat Autònoma de Barcelona (Autonomous University of Barcelona) via Coursera Introduction to NoSQL Databases
IBM via Coursera Recognize the Need for Document Databases
Pluralsight