HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Web as Huge Information Source for Noun Phrases Integration in the Information Retrieval Process

Abstract : Web is a rich and diversified source of information. In this article, we propose to benefit from this richness to collect and analyze documents, with the aim of a relational indexation based on noun phrases. Proposed data processing chain includes a spider collecting data to build textual corpora, and a linguistic module analyzing text to extract information. Comparison of obtained corpus with corpus from Amaryllis conference shows the linguistic diversity of collected corpora, and particularly the richness of extracted noun phrases.
Complete list of metadata

Cited literature [2 references]  Display  Hide  Download

https://hal.inria.fr/inria-00326405
Contributor : Dominique Vaufreydaz Connect in order to contact the contributor
Submitted on : Thursday, October 2, 2008 - 9:44:28 PM
Last modification on : Thursday, February 24, 2022 - 10:06:16 AM
Long-term archiving on: : Friday, June 4, 2010 - 12:09:15 PM

File

Gery02a.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00326405, version 1

Collections

Citation

Mathias Géry, Mohamed Hatem Haddad, Dominique Vaufreydaz. Web as Huge Information Source for Noun Phrases Integration in the Information Retrieval Process. International Conference on Information and Knowledge Engineering (IKE'02), Jun 2002, Las Vegas - Nevada, United States. pp. 72-77. ⟨inria-00326405⟩

Share

Metrics

Record views

146

Files downloads

117