Automatic Extension of WOLF

Abstract : In this paper we present the extension of WOLF, a freely available, automatically creat- ed wordnet for French, the biggest drawback of which has until now been the lack of general concepts that are typically expressed with highly polysemous vocabulary that is on the one hand the most valuable for applications in human language technologies but also the most difficult to add to wordnet accurately with automatic methods on the other. Using a set of features, we train a Maximum Entropy classifier on the existing core wordnet to be able to assign appropriate synset ids to new words, extracted from multiple, multilingual sources of lexical knowledge, such as Wik- tionaries, Wikipedias and corpora. Automatic and manual evaluation shows high coverage as well as high quality of the resulting lexico-semantic repository of. Another important ad- vantage of the approach is that it is fully au- tomatic and language-independent and could therefore be applied to any other language still lacking a wordnet.
Type de document :
Communication dans un congrès
GWC2012 - 6th International Global Wordnet Conference, Jan 2012, Matsue, Japan. 2012
Liste complète des métadonnées

Littérature citée [17 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00655774
Contributeur : Benoît Sagot <>
Soumis le : lundi 2 janvier 2012 - 12:34:05
Dernière modification le : mardi 17 avril 2018 - 11:31:35
Document(s) archivé(s) le : mardi 3 avril 2012 - 02:25:33

Fichier

gwa2012_61.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00655774, version 1

Collections

Citation

Benoît Sagot, Darja Fišer. Automatic Extension of WOLF. GWC2012 - 6th International Global Wordnet Conference, Jan 2012, Matsue, Japan. 2012. 〈hal-00655774〉

Partager

Métriques

Consultations de la notice

278

Téléchargements de fichiers

386