3532 articles – 5253 Notices  [english version]

inria-00418458, version 1

Grouping Synonyms by Definitions

Ingrid Falk () 1, Claire Gardent () 1, Evelyne Jacquey () 2, Fabienne Venant () a1

Recent Advances in Natural Language Processing (RANLP) (2009) 6

Résumé : We present a method for grouping the synonyms of a lemma according to its dictionary senses. The senses are defined by a large machine readable dictionary for French, the TLFi (Trésor de la langue française informatisé) and the synonyms are given by 5 synonym dictionaries (also for French). To evaluate the proposed method, we manually constructed a gold standard where for each (word, definition) pair and given the set of synonyms defined for that word by the 5 synonym dictionaries, 4 lexicographers specified the set of synonyms they judge adequate. While inter-annotator agreement ranges on that task from 67% to at best 88% depending on the annotator pair and on the synonym dictionary being considered, the automatic procedure we propose scores a precision of 67% and a recall of 71%. The proposed method is compared with related work namely, word sense disambiguation, synonym lexicon acquisition and WordNet construction.

  • a –  Université Nancy II
  • 1 :  TALARIS (INRIA Nancy - Grand Est / LORIA)
  • CNRS : UMR7503 – INRIA – Université Henri Poincaré - Nancy I – Université Nancy II – Institut National Polytechnique de Lorraine (INPL)
  • 2 :  Analyse et Traitement Informatique de la Langue Française (ATILF)
  • CNRS : UMR7118 – Université Henri Poincaré - Nancy I – Université Nancy II
  • Domaine : Sciences cognitives/Informatique
    Sciences cognitives/Linguistique
    Informatique/Informatique et langage
  • Mots-clés : Similarity measures – Synonyms – Lexical Acquisition
 
  • inria-00418458, version 1
  • oai:hal.inria.fr:inria-00418458
  • Contributeur : 
  • Soumis le : Vendredi 18 Septembre 2009, 16:23:05
  • Dernière modification le : Vendredi 18 Septembre 2009, 16:25:37