Skip to Main content Skip to Navigation
New interface
Conference papers

Exploiting Wikipedia Structure for Short Query Expansion in Cultural Heritage

Abstract : This paper deals with the short and precise queries problem. Short and precise queries do not have sufficient information to be non ambiguous. Pseudo-relevance feedback (PRF) is an effective technique to improve retrieval performance by expanding a user query. However,this collection based expansion method does not work well in the case of short queries. Therefore, we present instead of PRF, a semantic query expansion method based on Wikipedia as external knowledge. We expand short queries by semantically related terms extracted from Wikipedia. We propose and study the effectiveness of three variations for expansion terms selection. We incorporate the expansion terms into the original query and adapt language models to evaluate the expanded queries. Experiments on CLEF cultural heritage corpora show significant improvement in the retrieval performance. We show that the number of expansion terms has an important impact on the precision improvement.
Document type :
Conference papers
Complete list of metadata
Contributor : Marie-Christine Fauvet Connect in order to contact the contributor
Submitted on : Friday, February 28, 2014 - 10:52:08 AM
Last modification on : Wednesday, July 6, 2022 - 4:15:06 AM


  • HAL Id : hal-00953137, version 1



Mohannad Almasri, Jean-Pierre Chevallet, Catherine Berrut. Exploiting Wikipedia Structure for Short Query Expansion in Cultural Heritage. CORIA, 2014, Nancy, France. ⟨hal-00953137⟩



Record views