Skip to Main content Skip to Navigation
Conference papers

Building Thesaurus from Manual Sources and Automatic Scanned Texts

Abstract : This paper describes the work done in the TIPS project about the construction of a thesaurus base. This construction is a merge from a thesaurus manually built and one automatically extracted from large text corpora. Several manually built thesaurus have been semi-formatted to be merged in a consistent common base. The automatic extraction is based on both syntax and statistics. We present in this paper the way thesaurus are built and the results on Scientific corpus in the context of the TIPS project.
Document type :
Conference papers
Complete list of metadata

https://hal.inria.fr/hal-00954141
Contributor : Marie-Christine Fauvet <>
Submitted on : Friday, February 28, 2014 - 4:14:22 PM
Last modification on : Tuesday, December 8, 2020 - 10:42:35 AM
Long-term archiving on: : Friday, May 30, 2014 - 3:43:19 PM

File

tips_malaga.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00954141, version 1

Collections

Citation

Jean-Pierre Chevallet. Building Thesaurus from Manual Sources and Automatic Scanned Texts. The 2nd International Conference Adaptive Hypermedia and Adaptive Web Based Systems, 2002, Malaga, Spain. pp.95--104. ⟨hal-00954141⟩

Share

Metrics

Record views

273

Files downloads

66