Building a free French wordnet from multilingual resources

Abstract : This paper describes automatic construction a freely-available wordnet for French (WOLF) based on Princeton WordNet (PWN) by using various multilingual resources. Polysemous words were dealt with an approach in which a parallel corpus for five languages was word-aligned and the extracted multilingual lexicon was disambiguated with the existing wordnets for these languages. On the other hand, a bilingual approach sufficed to acquire equivalents for monosemous words. Bilingual lexicons were extracted from Wikipedia and thesauri. The results obtained from each resource were merged and ranked according to the number of resources yielding the same literal. Automatic evaluation of the merged wordnet was performed with the French WordNet (FREWN). Manual evaluation was also carried out on a sample of the generated synsets. Precision shows that the presented approach has proved to be very promising and applications to use the created wordnet are already intended.
Type de document :
Communication dans un congrès
OntoLex, May 2008, Marrakech, Morocco. 2008
Liste complète des métadonnées

Littérature citée [18 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00614708
Contributeur : Benoît Sagot <>
Soumis le : lundi 15 août 2011 - 11:34:04
Dernière modification le : mardi 11 octobre 2016 - 14:41:45
Document(s) archivé(s) le : vendredi 25 novembre 2011 - 11:11:56

Fichier

Ontolex08.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00614708, version 1

Collections

Citation

Benoît Sagot, Darja Fišer. Building a free French wordnet from multilingual resources. OntoLex, May 2008, Marrakech, Morocco. 2008. 〈inria-00614708〉

Partager

Métriques

Consultations de la notice

388

Téléchargements de fichiers

407