"Where the data are coming from?" Ethics, crowdsourcing and traceability for Big Data in Human Language Technology

Abstract : Based on the experience gained on the observation of the corpora developement in HLT, the authors want to warn the Big Data community about some recent usage of hu-man computation. For instance, the growing use in the HLT community of crowdsourcing methods, and especially of microworking retributed crowsourcing platforms, lead to many ethical, economical and juridical concerns. The au-thors want also to foster some behaviours, especially con-cerning traceability, implemented in the form of a charter, the Ethics and Big Data Charter .
Document type :
Conference papers
Complete list of metadatas

Cited literature [10 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01078045
Contributor : Alain Couillault <>
Submitted on : Tuesday, October 28, 2014 - 10:10:06 AM
Last modification on : Tuesday, September 17, 2019 - 1:13:04 AM
Long-term archiving on : Thursday, January 29, 2015 - 10:12:46 AM

File

charter.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01078045, version 1

Citation

Gilles Adda, Laurent Besacier, Alain Couillault, Karen Fort, Joseph Mariani, et al.. "Where the data are coming from?" Ethics, crowdsourcing and traceability for Big Data in Human Language Technology. Crowdsourcing and human computation multidisciplinary workshop, CNRS, Sep 2014, Paris, France. ⟨hal-01078045⟩

Share

Metrics

Record views

1115

Files downloads

368