Abstract : This paper describes the work done in the TIPS project about the construction of a thesaurus base. This construction is a merge from a thesaurus manually built and one automatically extracted from large text corpora. Several manually built thesaurus have been semi-formatted to be merged in a consistent common base. The automatic extraction is based on both syntax and statistics. We present in this paper the way thesaurus are built and the results on Scientific corpus in the context of the TIPS project.
https://hal.inria.fr/hal-00954141 Contributor : Marie-Christine FauvetConnect in order to contact the contributor Submitted on : Friday, February 28, 2014 - 4:14:22 PM Last modification on : Sunday, June 26, 2022 - 4:59:26 AM Long-term archiving on: : Friday, May 30, 2014 - 3:43:19 PM
Jean-Pierre Chevallet. Building Thesaurus from Manual Sources and Automatic Scanned Texts. The 2nd International Conference Adaptive Hypermedia and Adaptive Web Based Systems, 2002, Malaga, Spain. pp.95--104. ⟨hal-00954141⟩