Automatic construction of a TMF Terminological Database using a transducer cascade

Abstract : The automatic development of termino-logical databases, especially in a standardized format, has a crucial aspect for multiple applications related to technical and scientific knowledge that requires semantic and terminological descriptions covering multiple domains. In this context, we have two challenges: the first is the automatic extraction of terms in order to build a terminological database, and the second challenge is their normalization into a standardized format. To deal with these challenges, we propose an approach based on a cascade of transducers performed using CasSys tool of Unitex platform that benefits from both: the success of the rule-based approach for the extraction of terms, and the performance of the TMF standard for the representation of terms. We have tested and evaluated our approach on an Arabic scientific and technical documents for the Elevator domain and the results are very encouraging.
Document type :
Conference papers
Complete list of metadatas

Cited literature [12 references]  Display  Hide  Download

https://hal.inria.fr/hal-01276816
Contributor : Laurent Romary <>
Submitted on : Monday, February 22, 2016 - 12:14:52 PM
Last modification on : Sunday, June 2, 2019 - 10:24:02 AM
Long-term archiving on : Monday, May 23, 2016 - 11:10:47 AM

File

ranlp2015FINAL.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

  • HAL Id : hal-01276816, version 1

Collections

Citation

Chihebeddine Ammar, Kais Haddar, Laurent Romary. Automatic construction of a TMF Terminological Database using a transducer cascade. RANLP-2015, Sep 2015, Hissar, Bulgaria. ⟨hal-01276816⟩

Share

Metrics

Record views

420

Files downloads

191