Cost-Effective Updating of Distributed Reordered Indexes
Offered By: Association for Computing Machinery (ACM) via YouTube
Course Description
Overview
Explore cost-effective strategies for updating distributed reordered indexes in this 22-minute conference talk. Delve into index reordering techniques that optimize document collection numbering, enhancing inverted index compression. Examine the challenges of maintaining effective reorderings as collections grow over time, particularly in distributed retrieval systems. Learn about methods for preserving and reinstating reorderings, backed by experimental results from a large English news article corpus. Gain insights into the impact of reordering on query execution time and consider various update operations, including batch append. Discover practical approaches to balance index efficiency and maintenance costs in evolving document collections.
Syllabus
Intro
Inverted Indexing
Document Reordering
Distributed Retrieval Systems
Update Operations
Questions to consider
Data and Experiments
Batch Append Operations
Plus, One More Thing
Taught by
Association for Computing Machinery (ACM)
Related Courses
Advanced Operating SystemsGeorgia Institute of Technology via Udacity High Performance Computing
Georgia Institute of Technology via Udacity GT - Refresher - Advanced OS
Georgia Institute of Technology via Udacity Distributed Machine Learning with Apache Spark
University of California, Berkeley via edX CS125x: Advanced Distributed Machine Learning with Apache Spark
University of California, Berkeley via edX