Skip to Main content Skip to Navigation
Poster communications

Étiquetage morpho-syntaxique pour des mots nouveaux

Abstract : Part-of-speech (POS) taggers are more or less robust with respect to the labeling of unknown words not found in the training corpus. It is important to know precisely how these tools perfom when we target part-of-speech tagging for formal neologisms. Indeed, grammatical category is an important criterion for both their identification and documentation. We present an evaluation and comparison of 7 POS taggers for French, based on a corpus built from Wiktionary. The results show that the use of form-related or morphological features supports the accurate tagging of new words.
Document type :
Poster communications
Complete list of metadata
Contributor : Ingrid Falk Connect in order to contact the contributor
Submitted on : Friday, July 18, 2014 - 10:45:52 PM
Last modification on : Wednesday, October 21, 2020 - 9:12:03 AM
Long-term archiving on: : Thursday, November 20, 2014 - 4:08:21 PM



  • HAL Id : hal-00998866, version 1



Ingrid Falk, Delphine Bernhard, Christophe Gérard, Romain Potier-Ferry. Étiquetage morpho-syntaxique pour des mots nouveaux. Brigitte Bigi. 21ème conférence sur le Traitement Automatique des Langues Naturelles, Jul 2014, Marseille, France. 21ème Traitement Automatique des Langues Naturelles, pp.431, 2014. ⟨hal-00998866⟩



Les métriques sont temporairement indisponibles