Crowdsourcing for Language Resource Development: Criticisms About Amazon Mechanical Turk Overpowering Use

Abstract : This article is a position paper about Amazon Mechanical Turk, the use of which has been steadily growing in language processing in the past few years. According to the mainstream opinion expressed in articles of the domain, this type of on-line working platforms allows to develop quickly all sorts of quality language resources, at a very low price, by people doing that as a hobby. We shall demonstrate here that the situation is far from being that ideal. Our goal here is manifold: 1- to inform researchers, so that they can make their own choices, 2- to develop alternatives with the help of funding agencies and scientific associations, 3- to propose practical and organizational solutions in order to improve language resources development, while limiting the risks of ethical and legal issues without letting go price or quality, 4- to introduce an Ethics and Big Data Charter for the documentation of language resource
Type de document :
Chapitre d'ouvrage
Vetulani, Zygmunt and Mariani, Joseph. Human Language Technology Challenges for Computer Science and Linguistics, 8387, Springer International Publishing, pp.303-314, 2014, Lecture Notes in Computer Science, 978-3-319-08957-7. 〈10.1007/978-3-319-08958-4_25〉
Liste complète des métadonnées

Littérature citée [41 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01053047
Contributeur : Karën Fort <>
Soumis le : mardi 29 juillet 2014 - 14:28:42
Dernière modification le : jeudi 12 juillet 2018 - 12:56:03
Document(s) archivé(s) le : mardi 25 novembre 2014 - 20:16:32

Fichier

LNAI_AMT_Finale.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Karen Fort, Gilles Adda, Benoît Sagot, Joseph Mariani, Alain Couillault. Crowdsourcing for Language Resource Development: Criticisms About Amazon Mechanical Turk Overpowering Use. Vetulani, Zygmunt and Mariani, Joseph. Human Language Technology Challenges for Computer Science and Linguistics, 8387, Springer International Publishing, pp.303-314, 2014, Lecture Notes in Computer Science, 978-3-319-08957-7. 〈10.1007/978-3-319-08958-4_25〉. 〈hal-01053047〉

Partager

Métriques

Consultations de la notice

1031

Téléchargements de fichiers

703