F. Ahmad, S. Lee, M. Thottethodi, and T. Vijaykumar, Puma: Purdue Mapreduce benchmarks suite. ECE Technical Reports, 2012.

T. Bicer, W. Jiang, and G. Agrawal, Supporting fault tolerance in a data-intensive computing middleware, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), pp.1-12, 2010.
DOI : 10.1109/IPDPS.2010.5470462

D. Borthakur, Facebook has the world's largest Hadoop clus- ter! http, 2015.

J. Dean, Large-scale distributed systems at google: Current systems and future directions, Keynote speech at the 3rd ACM SIGOPS International Workshop on Large Scale Distributed Systems and Middleware (LADIS), 2009.

J. Dean and S. Ghemawat, MapReduce, Proceedings of the 6th USENIX conference on Symposium on Opearting Systems Design & Implementation (OSDI '04), pp.137-150, 2004.
DOI : 10.1145/1327452.1327492

J. Dean and S. Ghemawat, MapReduce, Communications of the ACM, vol.51, issue.1, pp.107-113, 2008.
DOI : 10.1145/1327452.1327492

F. Dinu and T. S. Ng, Understanding the effects and implications of compute node related failures in hadoop, Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing, HPDC '12, pp.187-198
DOI : 10.1145/2287076.2287108

A. Fox, R. Griffith, . Joseph, . Katz, . Konwinski et al., Above the clouds: A berkeley view of cloud computing

R. Ucb and . Eecs, 13. Derek Gottfrid. Self-service, prorated supercomputing fun! http://open.blogs.nytimes.com/self-service-prorated-super- computing-fun, p.13, 2007.

D. Huang, X. Shi, S. Ibrahim, L. Lu, H. Liu et al., MR-scope, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC '10, pp.849-855, 2010.
DOI : 10.1145/1851476.1851598

S. Ibrahim, B. He, and H. Jin, Towards Pay-As-You-Consume Cloud Computing, 2011 IEEE International Conference on Services Computing, pp.370-377, 2011.
DOI : 10.1109/SCC.2011.38

S. Ibrahim, H. Jin, L. Lu, B. He, G. Antoniu et al., Maestro: Replica-Aware Map Scheduling for MapReduce, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012), pp.59-72, 2012.
DOI : 10.1109/CCGrid.2012.122

URL : https://hal.archives-ouvertes.fr/hal-00670813

S. Ibrahim, H. Jin, L. Lu, L. Qi, S. Wu et al., Evaluating MapReduce on Virtual Machines: The Hadoop Case, Proceedings of the 1st International Conference on Cloud Computing (CLOUDCOM'09), pp.519-528, 2009.
DOI : 10.1007/978-3-642-10665-1_47

H. Jin, S. Ibrahim, L. Qi, H. Cao, S. Wu et al., The mapreduce programming model and implementations. Cloud Computing: Principles and Paradigms, pp.373-390, 2011.

A. Jindal, J. Quiané-ruiz, and J. Dittrich, Trojan data layouts, Proceedings of the 2nd ACM Symposium on Cloud Computing, SOCC '11, pp.1-21, 2011.
DOI : 10.1145/2038916.2038937

Y. Steven, I. Ko, B. Hoque, I. Cho, and . Gupta, Making cloud intermediate data fault-tolerant, the 1st ACM symposium on Cloud computing, pp.181-192, 2010.

E. Lai, Companies are spending a lot on Big Data. http://sites.tcs.com/ big-data-study/spending-on-big-data, 2015.

D. Logothetis, C. Olston, B. Reed, C. Kevin, K. Webb et al., Stateful bulk processing for incremental analytics, Proceedings of the 1st ACM symposium on Cloud computing, SoCC '10, pp.51-62, 2010.
DOI : 10.1145/1807128.1807138

J. Schad, J. Dittrich, and J. , Runtime measurements in the cloud, Proceedings of the VLDB Endowment, vol.3, issue.1-2, pp.460-471, 2010.
DOI : 10.14778/1920841.1920902

B. Thirumala-rao, N. V. Sridevi, V. K. Reddy, and L. S. Reddy, Performance issues of heterogeneous hadoop clusters in cloud computing. ArXiv e-prints, 2012.

A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka et al., Hive - a petabyte scale data warehouse using Hadoop, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010), pp.996-1005, 2010.
DOI : 10.1109/ICDE.2010.5447738

M. Zaharia, D. Borthakur, J. S. Sarma, K. Elmeleegy, S. Shenker et al., Delay scheduling, Proceedings of the 5th European conference on Computer systems, EuroSys '10, pp.265-278, 2010.
DOI : 10.1145/1755913.1755940

M. Zaharia, A. Konwinski, A. D. Joseph, R. Katz, and I. Stoica, Improving mapreduce performance in heterogeneous environments, Proceedings of the 8th USENIX Conference on Operating Systems Design and Implementation (OSDI'08), pp.29-42, 2008.

H. Zhu and H. Chen, Adaptive Failure Detection via Heartbeat under Hadoop, 2011 IEEE Asia-Pacific Services Computing Conference, pp.231-238, 2011.
DOI : 10.1109/APSCC.2011.46