RMIT INEX experiments: XML Retrieval using Lucy and eXist

Abstract : This paper reports on the RMIT group's approach to XML retrieval while participating in INEX 2003. We indexed XML documents using Lucy, a compact and fast text search engine designed and written by the Search Engine Group at RMIT University. For each INEX topic, up to 1000 highly ranked documents were then loaded and indexed by eXist, an open source native XML database. A query translator converts the INEX topics into corresponding Lucy and eXist query expressions respectively. These query expressions may represent traditional information retrieval tasks(unconstrained, CO topics), or may focus on retrieving and ranking specific document components (constrained, CAS topics). With respect to both these expressions types, we used eXist to extract final answers (either full documents or document components) frome those documents that were judged highly relevant by Luy. Several extraction strategies were used that diffeently influenced the ranking order of the final answers. The final INEX results show that our choice for a translation method and an extraction stategy leads to a very effective XML retrieval for the CAS topics. We observed a system limitation for CO topics resulting in the same or similar choice to have little or no impact on the retrieval performance.
Type de document :
Communication dans un congrès
2nd Workshop of the Initiative for the Evaluation of XML Retrieval (INEX'03), Dec 2003, Schloss Dagstuhl, Germany, 2003
Liste complète des métadonnées

Littérature citée [8 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00090569
Contributeur : Anne-Marie Vercoustre <>
Soumis le : vendredi 1 septembre 2006 - 16:40:50
Dernière modification le : jeudi 11 janvier 2018 - 17:22:01
Document(s) archivé(s) le : mardi 6 avril 2010 - 00:43:25

Identifiants

  • HAL Id : inria-00090569, version 1

Citation

Jovan Pehcevski, James Thom, Anne-Marie Vercoustre. RMIT INEX experiments: XML Retrieval using Lucy and eXist. 2nd Workshop of the Initiative for the Evaluation of XML Retrieval (INEX'03), Dec 2003, Schloss Dagstuhl, Germany, 2003. 〈inria-00090569〉

Partager

Métriques

Consultations de la notice

78

Téléchargements de fichiers

76