GRISP: A Massive Multilingual Terminological Database for Scientific and Technical Domains

Abstract : The development of a multilingual terminology is a very long and costly process. We present the creation of a multilingual terminological database called GRISP covering multiple technical and scientific fields from various open resources. A crucial aspect is the merging of the different resources which is based in our proposal on the definition of a sound conceptual model, different domain mapping and the use of structural constraints and machine learning techniques for controlling the fusion process. The result is a massive terminological database of several millions terms, concepts, semantic relations and definitions. This resource has allowed us to improve significantly the mean average precision of an information retrieval system applied to a large collection of multilingual and multidomain patent documents.
Type de document :
Communication dans un congrès
LREC 2010, May 2010, La Valette, Malta. 2010
Liste complète des métadonnées

Littérature citée [21 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00490312
Contributeur : Laurent Romary <>
Soumis le : mardi 8 juin 2010 - 11:34:28
Dernière modification le : vendredi 3 novembre 2017 - 08:24:20
Document(s) archivé(s) le : vendredi 17 septembre 2010 - 13:03:26

Fichier

LREC-PLLR-2010.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00490312, version 1

Collections

Citation

Patrice Lopez, Laurent Romary. GRISP: A Massive Multilingual Terminological Database for Scientific and Technical Domains. LREC 2010, May 2010, La Valette, Malta. 2010. 〈inria-00490312〉

Partager

Métriques

Consultations de la notice

805

Téléchargements de fichiers

262