Using Wikipedia Categories and Links in Entity Ranking

Abstract : This paper describes the participation of the INRIA group in the INEX 2007 XML entity ranking and ad hoc tracks. We developed a system for ranking Wikipedia entities in answer to a query. Our approach utilises the known categories, the link structure of Wikipedia, as well as the link co-occurrences with the examples (when provided) to improve the effectiveness of entity ranking. Our experiments on the training data set demonstrate that the use of categories and the link structure of Wikipedia, together with entity examples, can significantly improve entity retrieval effectiveness. We also use our system for the ad hoc tasks by inferring target categories from the title of the query. The results were worse than when using a full-text search engine, which confirms our hypothesis that ad hoc retrieval and entity retrieval are two different tasks.
Document type :
Conference papers
Norbert Fuhr and Jaap Kamps and Mounia Lalmas and Saadia Malik and Andrew Trotman. Proceedings of the sixth International Workshop of the Initiative for the Evaluation of XML Retrieval (INEX 2007), Dec 2007, Schloss Dagstuhl, Germany. Springer, Volume 4862/2008, pp. 321-335, 2008, LNCS. 〈10.1007/978-3-540-85902-4_28〉
Liste complète des métadonnées

Cited literature [12 references]  Display  Hide  Download

https://hal.inria.fr/inria-00192489
Contributor : Anne-Marie Vercoustre <>
Submitted on : Wednesday, November 28, 2007 - 12:20:29 PM
Last modification on : Tuesday, August 26, 2008 - 11:09:04 AM
Document(s) archivé(s) le : Monday, April 12, 2010 - 5:23:05 AM

File

inex07.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Anne-Marie Vercoustre, Jovan Pehcevski, James Thom. Using Wikipedia Categories and Links in Entity Ranking. Norbert Fuhr and Jaap Kamps and Mounia Lalmas and Saadia Malik and Andrew Trotman. Proceedings of the sixth International Workshop of the Initiative for the Evaluation of XML Retrieval (INEX 2007), Dec 2007, Schloss Dagstuhl, Germany. Springer, Volume 4862/2008, pp. 321-335, 2008, LNCS. 〈10.1007/978-3-540-85902-4_28〉. 〈inria-00192489〉

Share

Metrics

Record views

210

Files downloads

508