Skip to Main content Skip to Navigation
Conference papers

Le système WoDiS - WOlf & DIStributions pour la substitution lexicale

Kata Gábor 1
1 ALPAGE - Analyse Linguistique Profonde à Grande Echelle ; Large-scale deep linguistic processing
Inria Paris-Rocquencourt, UPD7 - Université Paris Diderot - Paris 7
Abstract : In this paper we describe the WoDiS system, as entered in the SemDis-TALN2014 lexical substitution shared task. Substitution candidates are generated from the WOLF (WordNet Libre du Français) and are clustered according to the structure of the synsets containing them to reflect the different senses of the target word. These senses are represented in a vector space specific to the target word, based on distributional data extracted from a corpus. This vector space is then mapped to the context with simple topical similarity metrics used in document classification. To overcome the data sparseness problem while representing the less frequent senses, we apply a lexical expansion method which allows to extract a higher number of relevant contexts and to compensate for the bias present in corpus-based distributional vectors. Our system ranked fourth in the final evaluation.
Complete list of metadata

Cited literature [28 references]  Display  Hide  Download
Contributor : Kata Gábor Connect in order to contact the contributor
Submitted on : Thursday, July 10, 2014 - 1:16:16 PM
Last modification on : Friday, January 21, 2022 - 3:21:43 AM
Long-term archiving on: : Friday, October 10, 2014 - 11:41:52 AM


Files produced by the author(s)


  • HAL Id : hal-01022406, version 1



Kata Gábor. Le système WoDiS - WOlf & DIStributions pour la substitution lexicale. Sémantique Distributionnelle - Atelier TALN 2014, Jul 2014, Marseille, France. ⟨hal-01022406⟩



Les métriques sont temporairement indisponibles