hal-00741099, version 1
An Information-Based Cross-Language Information Retrieval Model
34th European Conference on IR Research, ECIR 2012 7224 (2012) 281-292
Résumé : We present in this paper well-founded cross-language extensions of the recently introduced models in the information-based family for information retrieval, namely the LL (log-logistic) and SPL (smoothed power law) models of [4]. These extensions are based on (a) a generalization of the notion of information used in the information-based family, (b) a generalization of the random variables also used in this family, and (c) the direct expansion of query terms with their translations. We then review these extensions from a theoretical point-of-view, prior to assessing them experimentally. The results of the experimental comparisons between these extensions and existing CLIR systems, on three collections and three language pairs, reveal that the cross-language extension of the LL model provides a state-of-the-art CLIR system, yielding the best performance overall.
- 1 : Laboratoire d'Informatique de Grenoble (LIG)
- Université Joseph Fourier - Grenoble I – Institut Polytechnique de Grenoble - Grenoble Institute of Technology – Université Pierre-Mendès-France - Grenoble II – CNRS : UMR5217
- Domaine : Informatique/Apprentissage
Informatique/Intelligence artificielle
Informatique/Recherche d'information
- hal-00741099, version 1
- http://hal.archives-ouvertes.fr/hal-00741099
- oai:hal.archives-ouvertes.fr:hal-00741099
- Contributeur : Eric Gaussier
- Soumis le : Jeudi 11 Octobre 2012, 16:52:36
- Dernière modification le : Mardi 8 Janvier 2013, 14:29:42







Documents associés
Exporter