Crowdsourcing Complex Language Resources: Playing to Annotate Dependency Syntax

Bruno Guillaume 1 Karën Fort 2 Nicolas Lefèbvre 1
1 SEMAGRAMME - Semantic Analysis of Natural Language
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This article presents the results we obtained on a complex annotation task (that of dependency syntax) using a specifically designed Game with a Purpose, ZombiLingo. We show that with suitable mechanisms (decomposition of the task, training of the players and regular control of the annotation quality during the game), it is possible to obtain annotations whose quality is significantly higher than that obtainable with a parser, provided that enough players participate. The source code of the game and the resulting annotated corpora (for French) are freely available.
Document type :
Conference papers
Complete list of metadatas

Cited literature [29 references]  Display  Hide  Download

https://hal.inria.fr/hal-01378980
Contributor : Karën Fort <>
Submitted on : Tuesday, October 11, 2016 - 9:25:24 AM
Last modification on : Friday, September 6, 2019 - 11:48:09 AM
Long-term archiving on : Friday, January 13, 2017 - 1:30:20 AM

File

coling2016_zl.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01378980, version 1

Citation

Bruno Guillaume, Karën Fort, Nicolas Lefèbvre. Crowdsourcing Complex Language Resources: Playing to Annotate Dependency Syntax. International Conference on Computational Linguistics (COLING), Dec 2016, Osaka, Japan. ⟨hal-01378980⟩

Share

Metrics

Record views

682

Files downloads

677