Exploiting Locality of Wikipedia Links in Entity Ranking

Abstract : Information retrieval from web and XML document collections is ever more focused on returning entities instead of web pages or XML elements. There are many research fields involving named entities; one such field is known as entity ranking, where one goal is to rank entities in response to a query supported with a short list of entity examples. In this paper, we describe our approach to ranking entities from the Wikipedia XML document collection. Our approach utilises the known categories and the link structure of Wikipedia, and more importantly, exploits link co-occurrences to improve the effectiveness of entity ranking. Using the broad context of a full Wikipedia page as a baseline, we evaluate two different algorithms for identifying narrow contexts around the entity examples: one that uses predefined types of elements such as paragraphs, lists and tables; and another that dynamically identifies the contexts by utilising the underlying XML document structure. Our experiments demonstrate that the locality of Wikipedia links can be exploited to significantly improve the effectiveness of entity ranking.
Type de document :
Communication dans un congrès
The 30th annual European Conference on Information Retrieval (ECIR), Apr 2008, Glasgow, Scotland, United Kingdom. Springer, 2008, LNCS
Liste complète des métadonnées

Littérature citée [14 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00232794
Contributeur : Anne-Marie Vercoustre <>
Soumis le : vendredi 1 février 2008 - 17:33:16
Dernière modification le : vendredi 25 mai 2018 - 12:02:04
Document(s) archivé(s) le : lundi 3 mai 2010 - 16:09:52

Fichier

ecir08Final.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00232794, version 1

Collections

Citation

Jovan Pehcevski, Anne-Marie Vercoustre, James Thom. Exploiting Locality of Wikipedia Links in Entity Ranking. The 30th annual European Conference on Information Retrieval (ECIR), Apr 2008, Glasgow, Scotland, United Kingdom. Springer, 2008, LNCS. 〈inria-00232794〉

Partager

Métriques

Consultations de la notice

235

Téléchargements de fichiers

265