Extending wordnets by learning from multiple resources

Abstract : In this paper we present an automatic, language-independent approach to extend an existing wordnet by recycling existing freely available bilingual resources, such as machine-readable dictionaries and on-line encyclopaedias. The approach is applied to Slovene and French. The words extracted from the bilingual resources are assigned one or several synset ids based on a classifier that relies on several features, including distributional similarity. Automatic and manual evaluation shows that the resulting extensions of sloWNet and WOLF are lexico-semantic repositories of high coverage as well as high quality.
Type de document :
Communication dans un congrès
LTC'11 : 5th Language and Technology Conference, Nov 2011, Poznań, Poland. 2011, Human Language Technologies as a Challenge for Computer Science and Linguistics
Liste complète des métadonnées

Littérature citée [23 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00655785
Contributeur : Benoît Sagot <>
Soumis le : lundi 2 janvier 2012 - 14:21:27
Dernière modification le : samedi 9 juin 2018 - 10:30:06
Document(s) archivé(s) le : mardi 3 avril 2012 - 02:26:02

Fichier

ltc-87-sagot.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00655785, version 1

Collections

Citation

Benoît Sagot, Darja Fišer,. Extending wordnets by learning from multiple resources. LTC'11 : 5th Language and Technology Conference, Nov 2011, Poznań, Poland. 2011, Human Language Technologies as a Challenge for Computer Science and Linguistics. 〈hal-00655785〉

Partager

Métriques

Consultations de la notice

360

Téléchargements de fichiers

186