YoVDO

Microsecond Consensus for Microsecond Applications

Offered By: USENIX via YouTube

Tags

OSDI (Operating Systems Design and Implementation) Courses Fault Tolerance Courses Garbage Collection Courses Strong Consistency Courses RDMA Courses

Course Description

Overview

Explore a groundbreaking approach to fault-tolerant replication for microsecond-scale applications in this 20-minute conference talk from OSDI '20. Dive into the Mu system, which achieves replication in less than 1.3 microseconds and fail-over in under a millisecond, significantly outperforming previous systems. Learn how Mu leverages RDMA to implement state machine replication with strong consistency, addressing challenges such as concurrent leaders and log garbage collection. Discover the system's performance benefits when applied to various applications, including a financial exchange app, Redis, Memcached, and HERD. Gain insights into the innovative combination of RDMA permissions and distributed algorithmic design that enables Mu's exceptional speed and efficiency in replicating microsecond-scale applications.

Syllabus

Intro
Does Consensus Have to Be Slow?
Why Care About Microsecond Consensus?
Outline
Background: RDMA
Background: State Machine Replication
Mu Roles
Mu Common Case Replication
Common Case Replication: Intuition
Mu Background Plane
Mu Failure Detection
Evaluation: Setup
Evaluation: Replication Latency
Evaluation: Fail-over time
Conclusion


Taught by

USENIX

Related Courses

Creating and Administering Google Cloud Spanner Instances
Pluralsight
Architectural Patterns of Resilient Distributed Systems
Strange Loop Conference via YouTube
HTMFS - Strong Consistency Comes for Free with Hardware Transactional Memory in Persistent Memory File Systems
USENIX via YouTube
UniStore - A Fault-Tolerant Marriage of Causal and Strong Consistency
USENIX via YouTube
Spanner - Google's Distributed Database
Strange Loop Conference via YouTube