An Information-Based Cross-Language Information Retrieval Model

Bo Li 1 Éric Gaussier 2, *
* Corresponding author
1 AMA, MRIM
LIG - Laboratoire d'Informatique de Grenoble
2 AMA
LIG - Laboratoire d'Informatique de Grenoble
Abstract : We present in this paper well-founded cross-language extensions of the recently introduced models in the information-based family for information retrieval, namely the LL (log-logistic) and SPL (smoothed power law) models of [4]. These extensions are based on (a) a generalization of the notion of information used in the information-based family, (b) a generalization of the random variables also used in this family, and (c) the direct expansion of query terms with their translations. We then review these extensions from a theoretical point-of-view, prior to assessing them experimentally. The results of the experimental comparisons between these extensions and existing CLIR systems, on three collections and three language pairs, reveal that the cross-language extension of the LL model provides a state-of-the-art CLIR system, yielding the best performance overall.
Document type :
Conference papers
Ricardo Baeza-Yates, Arjen P. de Vries, Hugo Zaragoza, B. Barla Cambazoglu, Vanessa Murdock, Ronny Lempel and Fabrizio Silvestri. 34th European Conference on IR Research, ECIR 2012, Apr 2012, Barcelone, Spain. Springer, 7224, pp.281-292, 2012, Lecture Notes in Computer Science (LNCS). <10.1007/978-3-642-28997-2_24>


https://hal.archives-ouvertes.fr/hal-00741099
Contributor : Eric Gaussier <>
Submitted on : Thursday, October 11, 2012 - 4:52:36 PM
Last modification on : Tuesday, October 28, 2014 - 6:35:13 PM

File

Li-ecir2012.pdf
fileSource_public_author

Identifiers

Citation

Bo Li, Éric Gaussier. An Information-Based Cross-Language Information Retrieval Model. Ricardo Baeza-Yates, Arjen P. de Vries, Hugo Zaragoza, B. Barla Cambazoglu, Vanessa Murdock, Ronny Lempel and Fabrizio Silvestri. 34th European Conference on IR Research, ECIR 2012, Apr 2012, Barcelone, Spain. Springer, 7224, pp.281-292, 2012, Lecture Notes in Computer Science (LNCS). <10.1007/978-3-642-28997-2_24>. <hal-00741099>

Export

Share

Metrics

Consultation de
la notice

106

Téléchargement du document

32