J. Dean and S. Ghemawat, MapReduce, Communications of the ACM, vol.51, issue.1, pp.107-113, 2008.
DOI : 10.1145/1327452.1327492

J. Dean, Large-scale distributed systems at google: Current systems and future directions, in: Keynote speach at The 3rd ACM SIGOPS International Workshop on Large Scale Distributed Systems and Middleware, 2009.

G. Ananthanarayanan, S. Agarwal, S. Kandula, A. Greenberg, I. Stoica et al., Scarlett, Proceedings of the sixth conference on Computer systems, EuroSys '11, pp.287-300, 2011.
DOI : 10.1145/1966445.1966472

M. Zaharia, D. Borthakur, J. S. Sarma, K. Elmeleegy, S. Shenker et al., Delay scheduling, Proceedings of the 5th European conference on Computer systems, EuroSys '10, pp.265-278, 2010.
DOI : 10.1145/1755913.1755940

C. Hsu, K. D. Slagter, and Y. Chung, Locality and loading aware virtual machine mapping techniques for optimizing communications in MapReduce applications, Future Generation Computer Systems, vol.53, pp.43-54, 2015.
DOI : 10.1016/j.future.2015.04.006

O. Yildiz, S. Ibrahim, T. A. Phuong, and G. Antoniu, Chronos: Failure-Aware Scheduling in Shared Hadoop Clusters URL: https, the 2015 IEEE International Conference on Big Data, 2015.

H. Jin, S. Ibrahim, L. Qi, H. Cao, S. Wu et al., The mapreduce programming model and implementations, Cloud Computing: Principles and Paradigms, pp.373-390, 2011.

F. Dinu and T. E. Ng, Understanding the effects and implications of compute node related failures in hadoop, Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing, HPDC '12, pp.187-198, 2012.
DOI : 10.1145/2287076.2287108

D. Huang, X. Shi, S. Ibrahim, L. Lu, H. Liu et al., MR-scope, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC '10, pp.849-855, 2010.
DOI : 10.1145/1851476.1851598

K. Salem and H. Garcia-molina, Checkpointing memory-resident databases, [1989] Proceedings. Fifth International Conference on Data Engineering, pp.452-462, 1989.
DOI : 10.1109/ICDE.1989.47249

S. Ibrahim, T. Phan, A. Carpen-amarie, H. Chihoub, D. Moise et al., Governing energy consumption in Hadoop through CPU frequency scaling: An analysis, Future Generation Computer Systems, vol.54, 2015.
DOI : 10.1016/j.future.2015.01.005

URL : https://hal.archives-ouvertes.fr/hal-01166252

Y. Jégou, S. Lantéri, J. Leduc, N. Melab, G. Mornet et al., Grid'5000: a large scale and highly reconfigurable experimental Grid testbed, International Journal of High Performance Computing Applications, vol.20, pp.481-494, 2006.

F. Ahmad, S. Lee, M. Thottethodi, and T. Vijaykumar, Puma: Purdue Mapreduce benchmarks suite, ECE Technical Reports. Paper, vol.437, 2012.

Y. Chen, S. Alspaugh, and R. Katz, Interactive analytical processing in big data systems, Proceedings of the VLDB Endowment, pp.1802-1813, 2012.
DOI : 10.14778/2367502.2367519

M. Isard, V. Prabhakaran, J. Currey, U. Wieder, K. Talwar et al., Quincy, Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles, SOSP '09, pp.261-276, 2009.
DOI : 10.1145/1629575.1629601

V. K. Vavilapalli, A. C. Murthy, C. Douglas, S. Agarwal, M. Konar et al., Apache Hadoop YARN, Proceedings of the 4th annual Symposium on Cloud Computing, SOCC '13, pp.1-16, 2013.
DOI : 10.1145/2523616.2523633

K. Ren, Y. Kwon, M. Balazinska, and B. Howe, Hadoop's adolescence, Proc. VLDB Endow, pp.853-864, 2013.
DOI : 10.14778/2536206.2536213

J. Quiané-ruiz, C. Pinkel, J. Schad, and J. Dittrich, RAFTing MapReduce: Fast recovery on the RAFT, 2011 IEEE 27th International Conference on Data Engineering, pp.589-600, 2011.
DOI : 10.1109/ICDE.2011.5767877

S. Ibrahim, T. A. Phuong, and G. Antoniu, An Eye on the Elephant in the Wild: A Performance Evaluation of Hadoop???s Schedulers Under Failures, Workshop on Adaptive Resource Management and Scheduling for Cloud Computing (ARMS-CC-2015), held in conjunction with PODC'15, 2015.
DOI : 10.1007/978-3-319-28448-4_11

J. Schad, J. Dittrich, and J. Quiané-ruiz, Runtime measurements in the cloud, Proceedings of the VLDB Endowment, vol.3, issue.1-2, pp.460-471, 2010.
DOI : 10.14778/1920841.1920902

S. Venkataraman, A. Panda, G. Ananthanarayanan, M. J. Franklin, and I. Stoica, The power of choice in data-aware cluster scheduling, Proceedings of the 11th USENIX conference on Operating Systems Design and Implementation, USENIX Association, pp.301-316, 2014.

F. Dinu and T. Ng, RCMP: Enabling Efficient Recomputation Based Failure Resilience for Big Data Analytics, 2014 IEEE 28th International Parallel and Distributed Processing Symposium, pp.962-971
DOI : 10.1109/IPDPS.2014.102

Y. Wang, J. Tan, W. Yu, X. Meng, and L. Zhang, Preemptive reducetask scheduling for fair and fast job completion, Proceedings of the 10th International Conference on Autonomic Computing, ICAC, 2013.

L. Liu, Y. Zhou, M. Liu, G. Xu, X. Chen et al., Preemptive Hadoop Jobs Scheduling under a Deadline, 2012 Eighth International Conference on Semantics, Knowledge and Grids, pp.2012-72
DOI : 10.1109/SKG.2012.40

M. Pastorelli, M. Dell-'amico, and P. Michiardi, Os-assisted task preemption for hadoop, arXiv preprint arXiv:1402, p.2107, 2014.

G. Ananthanarayanan, C. Douglas, R. Ramakrishnan, S. Rao, and I. Stoica, True elasticity in multi-tenant data-intensive compute clusters, Proceedings of the Third ACM Symposium on Cloud Computing, SoCC '12, pp.2012-2036
DOI : 10.1145/2391229.2391253

S. Ibrahim, H. Jin, L. Lu, B. He, G. Antoniu et al., Maestro: Replica-Aware Map Scheduling for MapReduce, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012), pp.2012-435
DOI : 10.1109/CCGrid.2012.122

URL : https://hal.archives-ouvertes.fr/hal-00670813

S. Ibrahim, H. Jin, L. Lu, S. Wu, B. He et al., LEEN: Locality/Fairness-Aware Key Partitioning for MapReduce in the Cloud, 2010 IEEE Second International Conference on Cloud Computing Technology and Science, pp.17-24, 2010.
DOI : 10.1109/CloudCom.2010.25