YoVDO

Fast Erasure Coding for Data Storage - A Comprehensive Study of the Acceleration Techniques

Offered By: USENIX via YouTube

Tags

FAST (File and Storage Technologies) Courses Erasure Coding Courses

Course Description

Overview

Explore a comprehensive study on accelerating erasure coding techniques for data storage in this USENIX FAST '19 conference talk. Delve into various optimization methods including bitmatrix design, computation scheduling, XOR operation reduction, caching management, and vectorization. Learn how these techniques, previously proposed individually, can be combined for maximum efficiency. Discover a simple design approach that optimizes bitmatrices using a weighted cost function, and understand a straightforward erasure coding procedure that outperforms many existing techniques. Compare the performance against well-known codes like EVENODD, RDP, and STAR. Gain insights into why vectorizing XOR operations may be preferable to vectorizing finite field operations, considering both encoding throughput and CPU migration efforts. Examine individual acceleration techniques, optimization tiers, and their combinations through extensive testing results.

Syllabus

Intro
Data Reliability
Replication - Erasure Code
Acceleration Techniques
Cauchy Reed-Solomon Codes
Fast GF Ops - Binary Representation
Techniques Tiers
Bitmatrix Normalization (BN)
Smart Scheduling (SS)
Scheduling - Cache Optimization (S-CO)
Vectorization
Question to Answer
Individual Techniques
Optimization Tiers
Combinations (ij)-strategy
Choice of Cauchy Matrix
Cost Function Improvement
Proposed Coding Procedure
Testing Setup
Encoding v.s. Efficient RS/CRS code
Encoding v.s. Three Parities Codes
Encoding v.s. Two Parities Codes
Overall Encoding Improvement
Decoding Throughput
Conclusion


Taught by

USENIX

Related Courses

CRaft - An Erasure-coding-supported Version of Raft for Reducing Storage Cost
USENIX via YouTube
Carbink - Fault-Tolerant Far Memory
USENIX via YouTube
Hydra - Resilient and Highly Available Remote Memory
USENIX via YouTube
First Principles: Using Redundancy and Recovery for High Durability in OCI Object Storage
Oracle via YouTube
InfiniCache - Exploiting Ephemeral Serverless Functions to Build a Cost-Effective Memory Cache
USENIX via YouTube