Exploiting Wikipedia Structure for Short Query Expansion in Cultural Heritage - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

Exploiting Wikipedia Structure for Short Query Expansion in Cultural Heritage

Résumé

This paper deals with the short and precise queries problem. Short and precise queries do not have sufficient information to be non ambiguous. Pseudo-relevance feedback (PRF) is an effective technique to improve retrieval performance by expanding a user query. However,this collection based expansion method does not work well in the case of short queries. Therefore, we present instead of PRF, a semantic query expansion method based on Wikipedia as external knowledge. We expand short queries by semantically related terms extracted from Wikipedia. We propose and study the effectiveness of three variations for expansion terms selection. We incorporate the expansion terms into the original query and adapt language models to evaluate the expanded queries. Experiments on CLEF cultural heritage corpora show significant improvement in the retrieval performance. We show that the number of expansion terms has an important impact on the precision improvement.
Fichier non déposé

Dates et versions

hal-00953137 , version 1 (28-02-2014)

Identifiants

  • HAL Id : hal-00953137 , version 1

Citer

Mohannad Almasri, Jean-Pierre Chevallet, Catherine Berrut. Exploiting Wikipedia Structure for Short Query Expansion in Cultural Heritage. CORIA, 2014, Nancy, France. ⟨hal-00953137⟩
178 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More