J. Dean and S. Ghemawat, MapReduce, Communications of the ACM, vol.51, issue.1, pp.107-113, 2008.
DOI : 10.1145/1327452.1327492

H. Jin, L. Ibrahim, H. Qi, S. Cao, X. Wu et al., The mapreduce programming model and implementations, Cloud computing: Principles and Paradigms, pp.373-390, 2011.

M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly, Dryad: distributed data-parallel programs from sequential building blocks, Proceedings of the 2nd ACM European Conference on Computer Systems (EuroSys '07), pp.59-72, 2007.

B. He, W. Fang, Q. Luo, N. K. Govindaraju, and T. Wang, Mars, Proceedings of the 17th international conference on Parallel architectures and compilation techniques, PACT '08, pp.260-269, 2008.
DOI : 10.1145/1454115.1454152

C. Ranger, R. Raghuraman, A. Penmetsa, G. Bradski, and C. Kozyrakis, Evaluating MapReduce for Multi-core and Multiprocessor Systems, 2007 IEEE 13th International Symposium on High Performance Computer Architecture, pp.13-24, 2007.
DOI : 10.1109/HPCA.2007.346181

S. Ghemawat, H. Gobioff, and S. Leung, The Google file system, ACM SIGOPS Operating Systems Review, vol.37, issue.5, pp.29-43, 2003.
DOI : 10.1145/1165389.945450

Y. Kwon, M. Balazinska, B. Howe, and J. Rolia, A study of skew in mapreduce applications

J. Lin, The curse of zipf and limits to parallelization: A look at the stragglers problem in mapreduce, Proceedings of the 7th workshop on largescale distributed systems for information retrieval (LSDS-IR'09)

D. J. Dewitt and M. Stonebraker, Mapreduce: A major step backwards, 2008.

K. Wiley, A. Connolly, J. P. Gardner, S. Krughof, M. Balazinska et al., Astronomy in the cloud: Using MapReduce for image coaddition, 1010.

R. Chen, M. Yang, X. Weng, B. Choi, B. He et al., Improving large graph processing on partitioned graphs in the cloud, Proceedings of the Third ACM Symposium on Cloud Computing, SoCC '12
DOI : 10.1145/2391229.2391232

M. C. Schatz, CloudBurst: highly sensitive read mapping with MapReduce, Bioinformatics, vol.25, issue.11, pp.1363-1369, 2009.
DOI : 10.1093/bioinformatics/btp236

A. Verma, X. Llorà, D. E. Goldberg, and R. H. Campbell, Scaling Genetic Algorithms Using MapReduce, 2009 Ninth International Conference on Intelligent Systems Design and Applications, pp.13-18, 2009.
DOI : 10.1109/ISDA.2009.181

A. Y. Ng, G. Bradski, C. Chu, K. Olukotun, S. K. Kim et al., MapReduce for machine learning on multicore, Proceedings of the twentieth Annual Conference on Neural Information Processing Systems (NIPS' 06), pp.281-288, 2006.

J. Lin and M. Schatz, Design patterns for efficient graph algorithms in MapReduce, Proceedings of the Eighth Workshop on Mining and Learning with Graphs, MLG '10, pp.78-85
DOI : 10.1145/1830252.1830263

S. Ibrahim, H. Jin, L. Lu, S. Wu, B. He et al., LEEN: Locality/Fairness-Aware Key Partitioning for MapReduce in the Cloud, 2010 IEEE Second International Conference on Cloud Computing Technology and Science, pp.17-24, 2010.
DOI : 10.1109/CloudCom.2010.25

R. Jain, D. Chiu, and W. Hawe, A quantitative measure of fairness and discrimination for resource allocation in shared computer systems

S. Ibrahim, H. Jin, L. Lu, L. Qi, S. Wu et al., Evaluating MapReduce on Virtual Machines: The Hadoop Case, Proceedings of the 1st International Conference on Cloud Computing (CLOUDCOM'09), pp.519-528, 2009.
DOI : 10.1007/978-3-642-10665-1_47

S. Ibrahim, H. Jin, B. Cheng, H. Cao, S. Wu et al., CLOUDLET, Proceedings of the 18th ACM international symposium on High performance distributed computing, HPDC '09, pp.65-66, 2009.
DOI : 10.1145/1551609.1551624

M. Zaharia, D. Borthakur, J. S. Sarma, K. Elmeleegy, S. Shenker et al., Delay scheduling, Proceedings of the 5th European conference on Computer systems, EuroSys '10, pp.265-278, 2010.
DOI : 10.1145/1755913.1755940

S. Ibrahim, H. Jin, L. Lu, B. He, G. Antoniu et al., Maestro: Replica-Aware Map Scheduling for MapReduce, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012), 2012.
DOI : 10.1109/CCGrid.2012.122

URL : https://hal.archives-ouvertes.fr/hal-00670813

S. Ibrahim, H. Jin, L. Lu, B. He, and S. Wu, Adaptive Disk I/O Scheduling for MapReduce in Virtualized Environment, 2011 International Conference on Parallel Processing, pp.335-344, 2011.
DOI : 10.1109/ICPP.2011.86

R. K. Menon, G. P. Bhat, and M. C. Schatz, Rapid parallel genome indexing with MapReduce, Proceedings of the second international workshop on MapReduce and its applications, MapReduce '11, pp.51-58, 2011.
DOI : 10.1145/1996092.1996104

J. Ekanayake, S. Pallickara, and G. Fox, MapReduce for Data Intensive Scientific Analyses, 2008 IEEE Fourth International Conference on eScience, pp.277-284, 2008.
DOI : 10.1109/eScience.2008.59

T. Gunarathne, T. Wu, J. Qiu, and G. Fox, MapReduce in the Clouds for Science, 2010 IEEE Second International Conference on Cloud Computing Technology and Science, pp.565-572, 2010.
DOI : 10.1109/CloudCom.2010.107

Y. Ganjisaffar, T. Debeauvais, S. Javanmardi, R. Caruana, and C. V. Lopes, Distributed tuning of machine learning algorithms using MapReduce Clusters, Proceedings of the Third Workshop on Large Scale Data Mining Theory and Applications, LDMTA '11, pp.1-2, 2011.
DOI : 10.1145/2002945.2002947

S. Blanas, J. M. Patel, V. Ercegovac, J. Rao, E. J. Shekita et al., A comparison of join algorithms for log processing in MaPreduce, Proceedings of the 2010 international conference on Management of data, SIGMOD '10, pp.975-986, 2010.
DOI : 10.1145/1807167.1807273

D. Logothetis, C. Trezzo, K. C. Webb, and K. Yocum, In-situ mapreduce for log processing, Proceedings of the 2011 USENIX conference on USENIX annual technical conference, pp.9-9, 2011.

S. Seo, I. Jang, K. Woo, I. Kim, J. Kim et al., HPMR: Prefetching and pre-shuffling in shared MapReduce computation environment, 2009 IEEE International Conference on Cluster Computing and Workshops
DOI : 10.1109/CLUSTR.2009.5289171

Y. Su, P. Chen, J. Chang, and C. Shieh, Variable-sized map and locality-aware reduce on public-resource grids, Future Generation Computer Systems, vol.27, issue.6, pp.843-849, 2011.
DOI : 10.1016/j.future.2010.09.001

D. Dewitt and J. Gray, Parallel database systems: the future of high performance database systems, Communications of the ACM, vol.35, issue.6, pp.85-98, 1992.
DOI : 10.1145/129888.129894

S. Chen and S. W. Schlosser, Map-reduce meets wider varieties of applications, Intel Research Pittsburgh, 2008.

G. Ananthanarayanan, S. Kandula, A. Greenberg, I. Stoica, Y. Lu et al., Reining in the outliers in map-reduce clusters using mantri, Proceedings of the 9th USENIX conference on Operating systems design and implementation (OSDI'10), pp.1-16, 2010.

Y. Kwon, M. Balazinska, B. Howe, and J. Rolia, Skew-resistant parallel processing of feature-extracting scientific user-defined functions, Proceedings of the 1st ACM symposium on Cloud computing, SoCC '10
DOI : 10.1145/1807128.1807140

B. Gufler, N. Augsten, A. Reiser, and A. Kemper, Load Balancing in MapReduce Based on Scalable Cardinality Estimates, 2012 IEEE 28th International Conference on Data Engineering
DOI : 10.1109/ICDE.2012.58

Y. Kwon, M. Balazinska, B. Howe, and J. Rolia, SkewTune, Proceedings of the 2012 international conference on Management of Data, SIGMOD '12
DOI : 10.1145/2213836.2213840

B. He, M. Yang, Z. Guo, R. Chen, W. Lin et al., Wave computing in the cloud, Proceedings of the 12th conference on Hot topics in operating systems (HotOS'09)

B. He, M. Yang, Z. Guo, R. Chen, B. Su et al., Comet, Proceedings of the 1st ACM symposium on Cloud computing, SoCC '10
DOI : 10.1145/1807128.1807139

S. Ibrahim, B. He, and H. Jin, Towards Pay-As-You-Consume Cloud Computing, 2011 IEEE International Conference on Services Computing, pp.370-377, 2011.
DOI : 10.1109/SCC.2011.38