Web as Huge Information Source for Noun Phrases Integration in the Information Retrieval Process - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2002

Web as Huge Information Source for Noun Phrases Integration in the Information Retrieval Process

Résumé

Web is a rich and diversified source of information. In this article, we propose to benefit from this richness to collect and analyze documents, with the aim of a relational indexation based on noun phrases. Proposed data processing chain includes a spider collecting data to build textual corpora, and a linguistic module analyzing text to extract information. Comparison of obtained corpus with corpus from Amaryllis conference shows the linguistic diversity of collected corpora, and particularly the richness of extracted noun phrases.
Fichier principal
Vignette du fichier
Gery02a.pdf (182.87 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00326405 , version 1 (02-10-2008)

Identifiants

  • HAL Id : inria-00326405 , version 1

Citer

Mathias Géry, Mohamed Hatem Haddad, Dominique Vaufreydaz. Web as Huge Information Source for Noun Phrases Integration in the Information Retrieval Process. International Conference on Information and Knowledge Engineering (IKE'02), Jun 2002, Las Vegas - Nevada, United States. pp. 72-77. ⟨inria-00326405⟩
151 Consultations
124 Téléchargements

Partager

Gmail Facebook X LinkedIn More