YoVDO

Microsecond Consensus for Microsecond Applications

Offered By: USENIX via YouTube

Tags

OSDI (Operating Systems Design and Implementation) Courses Fault Tolerance Courses Garbage Collection Courses Strong Consistency Courses RDMA Courses

Course Description

Overview

Explore a groundbreaking approach to fault-tolerant replication for microsecond-scale applications in this 20-minute conference talk from OSDI '20. Dive into the Mu system, which achieves replication in less than 1.3 microseconds and fail-over in under a millisecond, significantly outperforming previous systems. Learn how Mu leverages RDMA to implement state machine replication with strong consistency, addressing challenges such as concurrent leaders and log garbage collection. Discover the system's performance benefits when applied to various applications, including a financial exchange app, Redis, Memcached, and HERD. Gain insights into the innovative combination of RDMA permissions and distributed algorithmic design that enables Mu's exceptional speed and efficiency in replicating microsecond-scale applications.

Syllabus

Intro
Does Consensus Have to Be Slow?
Why Care About Microsecond Consensus?
Outline
Background: RDMA
Background: State Machine Replication
Mu Roles
Mu Common Case Replication
Common Case Replication: Intuition
Mu Background Plane
Mu Failure Detection
Evaluation: Setup
Evaluation: Replication Latency
Evaluation: Fail-over time
Conclusion


Taught by

USENIX

Related Courses

Windows Server 2019: Advanced Networking Features
LinkedIn Learning
Accelerating Serverless AI Large Model Inference with Functionalized Scheduling and RDMA
CNCF [Cloud Native Computing Foundation] via YouTube
Database Consolidation With Persistent Memory
Oracle via YouTube
Deep Dive into GPU Support in Apache Spark 3.x - Accelerator-Aware Scheduling and RAPIDS Plugin
Databricks via YouTube
eBPF Strengthens SR-IOV for Powerful Kubernetes Networking
CNCF [Cloud Native Computing Foundation] via YouTube