Technical aspects of Thesaurus Construction in TIPS - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2002

Technical aspects of Thesaurus Construction in TIPS

Résumé

This paper describes the work done in the TIPS project about the construction of a thesaurus. This construction is a merge from a compilation of data from several web sources. These data comes from manual work, some data are real thesaurus, other are indexing recommendations. The merge is done with automatically extracted terms from large text corpora. The automatic extraction is based on both syntax and statistics. We present in this paper the way thesaurus are built and the results on Scientific corpus in the context of the TIPS project. This short paper emphasis on some technical aspects.
Fichier principal
Vignette du fichier
tips_thesaurus_technical.pdf (240.28 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00954142 , version 1 (28-02-2014)

Identifiants

  • HAL Id : hal-00954142 , version 1

Citer

Jean-Pierre Chevallet. Technical aspects of Thesaurus Construction in TIPS. [Research Report] 2002. ⟨hal-00954142⟩
123 Consultations
71 Téléchargements

Partager

Gmail Facebook X LinkedIn More