Conversion et améliorations de corpus du français annotés en Universal Dependencies

Bruno Guillaume 1 Marie-Catherine de Marneffe 2 Guy Perrier 1
1 SEMAGRAMME - Semantic Analysis of Natural Language
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper describes an effort to improve the consistency of two French corpora annotated with the Universal Dependencies (UD) scheme. The Universal Dependencies project aims at building a syntactic dependency scheme which allows similar analyses for several different languages. We improved the annotations of the two French corpora to render them closer to the UD scheme, and evaluated the changes done to the corpora in terms of closeness to the UD scheme as well as of internal corpus consistency.
Document type :
Journal articles
Complete list of metadatas

Cited literature [50 references]  Display  Hide  Download

https://hal.inria.fr/hal-02267418
Contributor : Bruno Guillaume <>
Submitted on : Monday, August 19, 2019 - 11:40:34 AM
Last modification on : Friday, January 10, 2020 - 1:55:12 PM
Long-term archiving on: Thursday, January 9, 2020 - 11:14:27 PM

File

UD_French.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02267418, version 1

Collections

Citation

Bruno Guillaume, Marie-Catherine de Marneffe, Guy Perrier. Conversion et améliorations de corpus du français annotés en Universal Dependencies. Traitement Automatique des Langues, ATALA, 2019, 60 (2), pp.71-95. ⟨hal-02267418⟩

Share

Metrics

Record views

78

Files downloads

385