Skip to Main content Skip to Navigation
Conference papers

Cheating a Parser to Death: Data-driven Cross-Treebank Annotation Transfer

Abstract : We present an efficient and accurate method for transferring annotations between two different treebanks of the same language. This method led to the creation of a new instance of the French Treebank (Abeillé et al., 2003), which follows the Universal Dependency annotation scheme and which was proposed to the participants of the CoNLL 2017 Universal Dependency parsing shared task (Zeman et al., 2017). Strong results from an evaluation on our gold standard (94.75% of LAS, 99.40% UAS on the test set) demonstrate the quality of this new annotated data set and validate our approach.
Document type :
Conference papers
Complete list of metadata

Cited literature [22 references]  Display  Hide  Download
Contributor : Benoît Sagot <>
Submitted on : Wednesday, May 23, 2018 - 11:13:29 PM
Last modification on : Monday, December 28, 2020 - 4:54:02 PM
Long-term archiving on: : Friday, August 24, 2018 - 11:33:26 PM


Files produced by the author(s)


  • HAL Id : hal-01798801, version 1


Djamé Seddah, Éric Villemonte de la Clergerie, Benoît Sagot, Hector Martinez Alonso, Marie Candito. Cheating a Parser to Death: Data-driven Cross-Treebank Annotation Transfer. Eleventh International Conference on Language Resources and Evaluation (LREC 2018), May 2018, Miyazaki, Japan. ⟨hal-01798801⟩



Record views


Files downloads