Skip to Main content Skip to Navigation
Reports

Technical aspects of Thesaurus Construction in TIPS

Abstract : This paper describes the work done in the TIPS project about the construction of a thesaurus. This construction is a merge from a compilation of data from several web sources. These data comes from manual work, some data are real thesaurus, other are indexing recommendations. The merge is done with automatically extracted terms from large text corpora. The automatic extraction is based on both syntax and statistics. We present in this paper the way thesaurus are built and the results on Scientific corpus in the context of the TIPS project. This short paper emphasis on some technical aspects.
Document type :
Reports
Complete list of metadata

https://hal.inria.fr/hal-00954142
Contributor : Marie-Christine Fauvet <>
Submitted on : Friday, February 28, 2014 - 4:14:23 PM
Last modification on : Tuesday, December 8, 2020 - 10:42:35 AM
Long-term archiving on: : Friday, May 30, 2014 - 3:43:28 PM

File

tips_thesaurus_technical.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00954142, version 1

Collections

Citation

Jean-Pierre Chevallet. Technical aspects of Thesaurus Construction in TIPS. [Research Report] 2002. ⟨hal-00954142⟩

Share

Metrics

Record views

240

Files downloads

86