RMIT INEX experiments: XML Retrieval using Lucy and eXist - Archive ouverte HAL Access content directly
Conference Papers Year : 2003

RMIT INEX experiments: XML Retrieval using Lucy and eXist


This paper reports on the RMIT group's approach to XML retrieval while participating in INEX 2003. We indexed XML documents using Lucy, a compact and fast text search engine designed and written by the Search Engine Group at RMIT University. For each INEX topic, up to 1000 highly ranked documents were then loaded and indexed by eXist, an open source native XML database. A query translator converts the INEX topics into corresponding Lucy and eXist query expressions respectively. These query expressions may represent traditional information retrieval tasks(unconstrained, CO topics), or may focus on retrieving and ranking specific document components (constrained, CAS topics). With respect to both these expressions types, we used eXist to extract final answers (either full documents or document components) frome those documents that were judged highly relevant by Luy. Several extraction strategies were used that diffeently influenced the ranking order of the final answers. The final INEX results show that our choice for a translation method and an extraction stategy leads to a very effective XML retrieval for the CAS topics. We observed a system limitation for CO topics resulting in the same or similar choice to have little or no impact on the retrieval performance.
Fichier principal
Vignette du fichier
RMIT-INEX03-final.pdf (215.05 Ko) Télécharger le fichier

Dates and versions

inria-00090569 , version 1 (01-09-2006)


  • HAL Id : inria-00090569 , version 1


Jovan Pehcevski, James A. Thom, Anne-Marie Vercoustre. RMIT INEX experiments: XML Retrieval using Lucy and eXist. 2nd Workshop of the Initiative for the Evaluation of XML Retrieval (INEX'03), Dec 2003, Schloss Dagstuhl, Germany. ⟨inria-00090569⟩
39 View
93 Download


Gmail Facebook Twitter LinkedIn More