Automatic Extension of WOLF

Abstract : In this paper we present the extension of WOLF, a freely available, automatically creat- ed wordnet for French, the biggest drawback of which has until now been the lack of general concepts that are typically expressed with highly polysemous vocabulary that is on the one hand the most valuable for applications in human language technologies but also the most difficult to add to wordnet accurately with automatic methods on the other. Using a set of features, we train a Maximum Entropy classifier on the existing core wordnet to be able to assign appropriate synset ids to new words, extracted from multiple, multilingual sources of lexical knowledge, such as Wik- tionaries, Wikipedias and corpora. Automatic and manual evaluation shows high coverage as well as high quality of the resulting lexico-semantic repository of. Another important ad- vantage of the approach is that it is fully au- tomatic and language-independent and could therefore be applied to any other language still lacking a wordnet.
Document type :
Conference papers
Complete list of metadatas

Cited literature [17 references]  Display  Hide  Download

https://hal.inria.fr/hal-00655774
Contributor : Benoît Sagot <>
Submitted on : Monday, January 2, 2012 - 12:34:05 PM
Last modification on : Thursday, April 4, 2019 - 1:26:43 AM
Long-term archiving on : Tuesday, April 3, 2012 - 2:25:33 AM

File

gwa2012_61.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00655774, version 1

Collections

Citation

Benoît Sagot, Darja Fišer. Automatic Extension of WOLF. GWC2012 - 6th International Global Wordnet Conference, Global Wordnet Association + Toyohashi University of Technology + National Institute of Japanese Language and Linguistics, Jan 2012, Matsue, Japan. ⟨hal-00655774⟩

Share

Metrics

Record views

541

Files downloads

487