Crowdsourcing for Language Resource Development: Criticisms About Amazon Mechanical Turk Overpowering Use

This article is a position paper about Amazon Mechanical Turk, the use of which has been steadily growing in language processing in the past few years. According to the mainstream opinion expressed in articles of the domain, this type of on-line working platforms allows to develop quickly all sorts of quality language resources, at a very low price, by people doing that as a hobby. We shall demonstrate here that the situation is far from being that ideal. Our goal here is manifold: 1- to inform researchers, so that they can make their own choices, 2- to develop alternatives with the help of funding agencies and scientific associations, 3- to propose practical and organizational solutions in order to improve language resources development, while limiting the risks of ethical and legal issues without letting go price or quality, 4- to introduce an Ethics and Big Data Charter for the documentation of language resource

Mots clés

Amazon Mechanical Turk Language resources Ethics

Domaines

Traitement du texte et du document

Fichier principal

LNAI_AMT_Finale.pdf (367.35 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Karën Fort : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01053047

Soumis le : mardi 29 juillet 2014-14:28:42

Dernière modification le : mercredi 7 février 2024-03:34:08

Archivage à long terme le : mardi 25 novembre 2014-20:16:32

Dates et versions

hal-01053047 , version 1 (29-07-2014)

Identifiants

HAL Id : hal-01053047 , version 1
DOI : 10.1007/978-3-319-08958-4_25

Citer

Karen Fort, Gilles Adda, Benoît Sagot, Joseph Mariani, Alain Couillault. Crowdsourcing for Language Resource Development: Criticisms About Amazon Mechanical Turk Overpowering Use. Vetulani, Zygmunt and Mariani, Joseph. Human Language Technology Challenges for Computer Science and Linguistics, 8387, Springer International Publishing, pp.303-314, 2014, Lecture Notes in Computer Science, 978-3-319-08957-7. ⟨10.1007/978-3-319-08958-4_25⟩. ⟨hal-01053047⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-PARIS7 UNIV-RENNES1 CNRS INRIA IRISA LIMSI UNIV-LORRAINE INRIA2 CAMPUS-AAR AAI LORIA LORIA-NLPKD UR1-MATH-STIC UR1-UFR-ISTIC UNIV-ROCHELLE UNIV-RENNES SORBONNE-UNIVERSITE UR1-MATH-NUM LISN GS-SPORT-HUMAN-MOVEMENT

571 Consultations

1123 Téléchargements