C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins, Pig latin, Proceedings of the 2008 ACM SIGMOD international conference on Management of data , SIGMOD '08, pp.1099-1110, 2008.
DOI : 10.1145/1376616.1376726

J. Ekanayake, H. Li, B. Zhang, T. Gunarathne, S. H. Bae et al., Twister, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC '10, 2010.
DOI : 10.1145/1851476.1851593

G. Fedak, H. He, and F. Cappello, BitDew: A programmable environment for large-scale data management and distribution, 2008 SC, International Conference for High Performance Computing, Networking, Storage and Analysis, pp.1-12, 2008.
DOI : 10.1109/SC.2008.5213939

URL : https://hal.archives-ouvertes.fr/inria-00216126

A. Simonet, G. Fedak, and M. Ripeanu, Active Data: A programming model to manage data life cycle across heterogeneous systems and infrastructures, Future Generation Computer Systems, vol.53, 2015.
DOI : 10.1016/j.future.2015.05.015

URL : https://hal.archives-ouvertes.fr/hal-01241491

B. Tang, M. Moca, S. Chevalier, H. He, and G. Fedak, Towards MapReduce for Desktop Grid Computing, 2010 International Conference on P2P, Parallel, Grid, Cloud and Internet Computing, pp.193-200, 2010.
DOI : 10.1109/3PGCIC.2010.33

URL : https://hal.archives-ouvertes.fr/hal-00687553

M. Moca, G. Silaghi, and G. Fedak, Distributed Results Checking for MapReduce in Volunteer Computing, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum, pp.1847-1854, 2011.
DOI : 10.1109/IPDPS.2011.351

J. C. Anjos, G. Fedak, and C. F. Geyer, BIGhybrid: a simulator for MapReduce applications in hybrid distributed infrastructures validated with the Grid5000 experimental platform, Concurrency and Computation: Practice and Experience, 2015.
DOI : 10.1002/cpe.3665

URL : https://hal.archives-ouvertes.fr/hal-01239382

P. Bhatotia, A. Wieder, R. Rodrigues, U. A. Acar, and R. Pasquin, Incoop, Proceedings of the 2nd ACM Symposium on Cloud Computing, SOCC '11, p.7, 2011.
DOI : 10.1145/2038916.2038923

D. Peng and F. Dabek, Large-scale incremental processing using distributed transactions and notifications, Proceedings of the 9th USENIX Symposium on Operating Systems Design and Implementation, 2010.

B. Lohrmann, D. Warneke, and O. Kao, Massively-parallel stream processing under QoS constraints with Nephele, Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing, HPDC '12, pp.271-282, 2012.
DOI : 10.1145/2287076.2287117

T. Condie, N. Conway, P. Alvaro, J. M. Hellerstein, K. Elmeleegy et al., Mapreduce online, Proceedings of the 7th USENIX conference on Networked systems design and implementation, pp.21-21, 2010.

J. C. Corbett, Spanner, Proceedings of OSDI, 2012.
DOI : 10.1145/2518037.2491245

H. Lin, X. Ma, J. Archuleta, W. C. Feng, M. Gardner et al., MOON, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC '10, pp.95-106, 2010.
DOI : 10.1145/1851476.1851489

URL : https://hal.archives-ouvertes.fr/in2p3-00024076

F. Marozzo, D. Talia, and P. Trunfio, P2P-MapReduce: Parallel data processing in dynamic Cloud environments, Journal of Computer and System Sciences, vol.78, issue.5, pp.1382-1402, 2012.
DOI : 10.1016/j.jcss.2011.12.021

B. Tang, H. He, and G. Fedak, HybridMR: a new approach for hybrid MapReduce combining desktop grid and cloud infrastructures, Concurrency and Computation: Practice and Experience, vol.20, issue.4, 2015.
DOI : 10.1002/cpe.3515

URL : https://hal.archives-ouvertes.fr/hal-01239299

G. Antoniu, all: Scalable Data Management for MapReduce-Based Data-Intensive Applications: a View for Cloud and Hybrid Infrastructures, International Journal on Cloud Computing, vol.2, issue.2-3, 2013.