HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Reading Contexts for Structured Documents Retrieval

Abstract : This paper focuses on the retrieval of parts of structured document called doxels. We propose a notion of reading context of a doxel and we exploit it to extend an Indexing Language Model (LM) with Dirichlet smoothing. We interpret a context of a doxel as a propagation of the content of the connected doxels via document structure links. We experiment this model on INEX corpus 2009, and test different context propagations. We measure a significant increase in results using contexts, compared to a reference approach without the use of context for 3 types of doxels. Moreover, our proposal outperforms the best result obtained for the Focused evaluation for the Ad Hoc task at INEX 2009.
Document type :
Conference papers
Complete list of metadata

https://hal.inria.fr/hal-00953082
Contributor : Marie-Christine Fauvet Connect in order to contact the contributor
Submitted on : Friday, February 28, 2014 - 10:31:39 AM
Last modification on : Thursday, October 21, 2021 - 3:47:36 AM

Identifiers

  • HAL Id : hal-00953082, version 1

Collections

Citation

Philippe Mulhem, Jean-Pierre Chevallet. Reading Contexts for Structured Documents Retrieval. OAIR, 2013, Lisbon, Portugal. pp.47-52. ⟨hal-00953082⟩

Share

Metrics

Record views

54