Performance Improvements for Search Systems using an Integrated Cache of Lists+Intersections

Abstract : Modern information retrieval systems use sophisticated techniques for efficiency and scalability purposes. Among the most frequent such techniques is the implementation of several levels of caching. The main goal of a cache is to speedup computation by exploiting frequent, recent or costly data used in the past. In this study we propose and evaluate a static cache that works simultaneously as list and intersection cache, offering a more efficient way of handling cache space. In addition, we propose effective strategies to select the term pairs that should populate the cache. Simulation using two datasets and a real query log reveal that the proposed approach improves overall performance in terms of total processing time, achieving savings of up to 40% in the best case.
Document type :
Journal articles
Complete list of metadatas

Cited literature [24 references]  Display  Hide  Download

https://hal.inria.fr/hal-01528536
Contributor : Marie-France Sagot <>
Submitted on : Monday, May 29, 2017 - 12:17:40 PM
Last modification on : Friday, April 19, 2019 - 1:37:49 AM
Long-term archiving on : Wednesday, September 6, 2017 - 10:37:29 AM

File

alberto2017_1.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Gabriel Tolosa, Luca Becchetti, Esteban Feuerstein, Alberto Marchetti-Spaccamela. Performance Improvements for Search Systems using an Integrated Cache of Lists+Intersections. Information Retrieval Journal, Springer, 2017, 20 (3), pp.172-198. ⟨10.1007/978-3-319-11918-2_22⟩. ⟨hal-01528536⟩

Share

Metrics

Record views

180

Files downloads

447