Crowdsourcing Complex Language Resources: Playing to Annotate Dependency Syntax

Bruno Guillaume 1 Karën Fort 2 Nicolas Lefèbvre 1
1 SEMAGRAMME - Semantic Analysis of Natural Language
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This article presents the results we obtained on a complex annotation task (that of dependency syntax) using a specifically designed Game with a Purpose, ZombiLingo. We show that with suitable mechanisms (decomposition of the task, training of the players and regular control of the annotation quality during the game), it is possible to obtain annotations whose quality is significantly higher than that obtainable with a parser, provided that enough players participate. The source code of the game and the resulting annotated corpora (for French) are freely available.
Type de document :
Communication dans un congrès
International Conference on Computational Linguistics (COLING), Dec 2016, Osaka, Japan. 2016, Proceedings of the 26th International Conference on Computational Linguistics (COLING). <http://coling2016.anlp.jp/>
Liste complète des métadonnées

https://hal.inria.fr/hal-01378980
Contributeur : Karën Fort <>
Soumis le : mardi 11 octobre 2016 - 09:25:24
Dernière modification le : jeudi 13 octobre 2016 - 01:06:04
Document(s) archivé(s) le : vendredi 13 janvier 2017 - 01:30:20

Fichier

coling2016_zl.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01378980, version 1

Collections

Citation

Bruno Guillaume, Karën Fort, Nicolas Lefèbvre. Crowdsourcing Complex Language Resources: Playing to Annotate Dependency Syntax. International Conference on Computational Linguistics (COLING), Dec 2016, Osaka, Japan. 2016, Proceedings of the 26th International Conference on Computational Linguistics (COLING). <http://coling2016.anlp.jp/>. <hal-01378980>

Partager

Métriques

Consultations de
la notice

192

Téléchargements du document

464