TiDedup - A New Distributed Deduplication Architecture for Ceph
Offered By: USENIX via YouTube
Course Description
Overview
Explore a groundbreaking conference talk on TiDedup, a novel cluster-level deduplication architecture for Ceph, presented at USENIX ATC '23. Delve into the innovative solutions addressing key shortcomings in Ceph's existing deduplication design, including excessive metadata consumption, serialized tiering mechanism limitations, and inefficient reference count mechanisms. Discover three pioneering schemes introduced by TiDedup: selective cluster-level crawling, an event-driven tiering mechanism with content-defined chunking, and a reference correction method using shared reference back pointers. Learn about the successful integration of TiDedup into the Ceph mainline and its impressive performance results, showcasing up to 34% data reduction on real-world workloads, 50% improvement in foreground I/O throughput during deduplication, and a significant reduction in reference correction scan time by over 50%. Gain valuable insights into this cutting-edge distributed storage system enhancement presented by experts from Samsung Electronics, IBM, Ceph Foundation, and Seoul National University.
Syllabus
USENIX ATC '23 - TiDedup: A New Distributed Deduplication Architecture for Ceph4
Taught by
USENIX
Related Courses
Observing and Analysing Performance in SportOpenLearning Introduction aux réseaux mobiles
Institut Mines-Télécom via France Université Numerique Claves para Gestionar Personas
IESE Business School via Coursera الأجهزة الطبية في غرف العمليات والعناية المركزة
Rwaq (رواق) Clinical Supervision with Confidence
University of East Anglia via FutureLearn