T. Hey, S. Tansley, and K. M. Tolle, The Fourth Paradigm ??? Data-Intensive Scientific Discovery, 2009.
DOI : 10.1007/978-3-642-33299-9_1

J. Dean and S. Ghemawat, MapReduce, Communications of the ACM, vol.51, issue.1, pp.107-113, 2008.
DOI : 10.1145/1327452.1327492

M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma et al., Resilient Distributed Datasets, NSDI'12: The 9th USENIX Symposium on Networked Systems Design and Implementation, pp.15-28
DOI : 10.1145/2886107.2886110

G. Graefe, Encapsulation of parallelism in the volcano query processing system, " in SIGMOD '90: The, ACM SIGMOD International Conference on Management of Data, pp.102-111, 1990.

C. Baru and G. Fecteau, An overview of DB2 parallel edition, ACM SIGMOD Record, vol.24, issue.2, pp.460-462, 1995.
DOI : 10.1145/568271.223876

B. Nicolae, Understanding Vertical Scalability of I/O Virtualization for MapReduce Workloads: Challenges and Opportunities, Big- DataCloud '13: 2nd Workshop on Big Data Management in Clouds (held in conjunction with EuroPar'13), 2013.
DOI : 10.1007/978-3-642-54420-0_1

URL : https://hal.archives-ouvertes.fr/hal-00856877

J. Tan, A. Chin, Z. Z. Hu, Y. Hu, S. Meng et al., DynMR, Proceedings of the Ninth European Conference on Computer Systems, EuroSys '14, pp.1-2, 2014.
DOI : 10.1145/2592798.2592805

K. Ousterhout, R. Rasti, S. Ratnasamy, S. Shenker, and B. Chun, Making sense of performance in data analytics frameworks, NSDI'15: The 12th USENIX Conference on Networked Systems Design and Implementation, pp.293-307, 2015.

B. Nicolae, D. Moise, G. Antoniu, L. Bougé, and M. Dorier, BlobSeer: Bringing high throughput under heavy concurrency to Hadoop Map-Reduce applications, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), pp.1-12, 2010.
DOI : 10.1109/IPDPS.2010.5470433

URL : https://hal.archives-ouvertes.fr/inria-00456801

H. Li, A. Ghodsi, M. Zaharia, S. Shenker, and I. Stoica, Tachyon, Proceedings of the ACM Symposium on Cloud Computing, SOCC '14, pp.1-6
DOI : 10.1145/2670979.2670985

G. Greiner and R. Jacob, The Efficiency of MapReduce in Parallel External Memory, LATIN'12: Proceedings of the 10th Latin American International Conference on Theoretical Informatics, pp.433-445, 2012.
DOI : 10.1007/978-3-642-29344-3_37

M. W. Rahman, X. Lu, N. S. Islam, and D. K. Panda, HOMR, Proceedings of the 28th ACM international conference on Supercomputing, ICS '14, pp.33-42, 2014.
DOI : 10.1145/2597652.2597684

X. Lu, M. W. Rahman, N. Islam, D. Shankar, and D. K. Panda, Accelerating Spark with RDMA for Big Data Processing: Early Experiences, 2014 IEEE 22nd Annual Symposium on High-Performance Interconnects, pp.9-16, 2014.
DOI : 10.1109/HOTI.2014.15

A. Davidson and A. Or, Optimizing shuffle performance in spark, 2013.

B. Nicolae, On the benefits of transparent compression for costeffective cloud data storage Transactions on Large-Scale Data-and Knowledge-Centered Systems, pp.167-184, 2011.