YoVDO

Physical vs. Logical Indexing with IDEA - Inverted Deduplication-Aware Index

Offered By: USENIX via YouTube

Tags

Information Retrieval Courses Data Structures Courses Search Algorithms Courses

Course Description

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a groundbreaking approach to term-indexing in deduplicated data systems through this 26-minute conference talk from FAST '24. Dive into the challenges of maintaining efficient term-indexing in the face of growing online data and widespread use of data deduplication in storage systems. Learn about IDEA (Inverted Deduplication-Aware Index), a novel design that maps terms to unique data chunks and chunks to containing files, addressing inefficiencies in traditional indexing methods. Discover how this approach can significantly reduce index size, indexing time, and term-lookup latency while supporting advanced functionalities like inline indexing, result ranking, and proximity search. Gain insights from a prototype implementation based on Lucene, demonstrating substantial improvements in various performance metrics compared to conventional indexing techniques.

Syllabus

FAST '24 - Physical vs. Logical Indexing with IDEA: Inverted Deduplication-Aware Index


Taught by

USENIX

Related Courses

Design and Analysis of Algorithms
Chennai Mathematical Institute via Swayam
How to Win Coding Competitions: Secrets of Champions
ITMO University via edX
Artificial Intelligence
Georgia Institute of Technology via Udacity
Introdução à Ciência da Computação com Python Parte 2
Universidade de São Paulo via Coursera
Introducción a la programación en Java: empezando a programar
Universidad Carlos iii de Madrid via edX