Skip to Main content Skip to Navigation
Conference papers

Building Thesaurus from Manual Sources and Automatic Scanned Texts

Abstract : This paper describes the work done in the TIPS project about the construction of a thesaurus base. This construction is a merge from a thesaurus manually built and one automatically extracted from large text corpora. Several manually built thesaurus have been semi-formatted to be merged in a consistent common base. The automatic extraction is based on both syntax and statistics. We present in this paper the way thesaurus are built and the results on Scientific corpus in the context of the TIPS project.
Document type :
Conference papers
Complete list of metadata
Contributor : Marie-Christine Fauvet Connect in order to contact the contributor
Submitted on : Friday, February 28, 2014 - 4:14:22 PM
Last modification on : Sunday, June 26, 2022 - 4:59:26 AM
Long-term archiving on: : Friday, May 30, 2014 - 3:43:19 PM


Files produced by the author(s)


  • HAL Id : hal-00954141, version 1



Jean-Pierre Chevallet. Building Thesaurus from Manual Sources and Automatic Scanned Texts. The 2nd International Conference Adaptive Hypermedia and Adaptive Web Based Systems, 2002, Malaga, Spain. pp.95--104. ⟨hal-00954141⟩



Record views


Files downloads