D. Jeffrey, Large-scale distributed systems at google: Current systems and future directions, Keynote speach at the 3rd ACM SIGOPS International Workshop on Large Scale Distributed Systems and Middleware (LADIS'09, 2009.

A. Qureshi, Power-demand routing in massive geo-distributed systems, Ph.D.dissertation, MIT, 2010.

T. Gunarathne, T. Wu, J. Qiu, and G. Fox, MapReduce in the Clouds for Science, 2010 IEEE Second International Conference on Cloud Computing Technology and Science, pp.565-572, 2010.
DOI : 10.1109/CloudCom.2010.107

G. Lee, B. Chun, and H. Katz, Heterogeneity-aware resource allocation and scheduling in the cloud, Proceedings of the 3rd USENIX Conference on Hot Topics in Cloud Computing (HotCloud'11). USENIX Association, pp.4-4, 2011.

M. Zaharia, A. Konwinski, A. D. Joseph, R. Katz, and I. Stoica, Improving mapreduce performance in heterogeneous environments, Proceedings of the 8th USENIX Conference on Operating Systems Design and Implementation (OSDI'08). USENIX Association, pp.29-42, 2008.

J. Lin and M. Schatz, Design patterns for efficient graph algorithms in MapReduce, Proceedings of the Eighth Workshop on Mining and Learning with Graphs, MLG '10, pp.78-85, 2010.
DOI : 10.1145/1830252.1830263

K. Wiley, A. Connolly, J. P. Gardner, S. Krughof, M. Balazinska et al., Astronomy in the cloud: Using MapReduce for image coaddition, Proceedings of the 20th Annual Conference on Astronomical Data Analysis Software and Systems (ADASS'11), pp.93-96, 2011.

S. Ibrahim, H. Jin, L. Lu, B. He, G. Antoniu et al., Handling partitioning skew in mapreduce using leen Peer-to-Peer Networking and Applications Towards efficient power management in mapreduce: Investigation of cpufrequencies scaling on power efficiency in hadoop, Proceedings of the 1st Workshop on Adaptive Resource Management and Scheduling for Cloud Computing (ARMS-CC'14), pp.409-424, 2013.

T. Wirtz and R. Ge, Improving MapReduce energy efficiency for computation intensive workloads, 2011 International Green Computing Conference and Workshops, pp.1-8, 2011.
DOI : 10.1109/IGCC.2011.6008564

R. T. Kaushik and M. Bhandarkar, Greenhdfs: towards an energy-conserving, storage-efficient, hybrid hadoop compute cluster, Proceedings of the 2010 International Conference on Power aware computing and systems (HotPower'10). USENIX Association, pp.1-9, 2010.

H. Amur, J. Cipar, V. Gupta, G. R. Ganger, M. A. Kozuch et al., Robust and flexible power-proportional storage, Proceedings of the 1st ACM symposium on Cloud computing, SoCC '10, pp.217-228, 2010.
DOI : 10.1145/1807128.1807164

N. Vasi´cvasi´c, M. Barisits, V. Salzgeber, and D. Kostic, Making cluster applications energy-aware, Proceedings of the 1st Workshop on Automated control for datacenters and clouds (ACDC'09, pp.37-42, 2009.

W. Lang and J. M. Patel, Energy management for MapReduce clusters, Proceedings of the VLDB Endowment, pp.129-139, 2010.
DOI : 10.14778/1920841.1920862

M. Cardosa, A. Singh, H. Pucha, and A. Chandra, Exploiting spatio-temporal tradeoffs for energy-aware mapreduce in the cloud, Proceedings of the 2011 IEEE International Conference on Cloud Computing, pp.251-258, 2011.

I. Goiri, K. Le, T. D. Nguyen, J. Guitart, J. Torres et al., GreenHadoop, Proceedings of the 7th ACM european conference on Computer Systems, EuroSys '12, pp.57-70, 2012.
DOI : 10.1145/2168836.2168843

Y. Jégou, S. Lantéri, J. Leduc, N. Melab, G. Mornet et al., Grid'5000: a large scale and highly reconfigurable experimental Grid testbed, International Journal of High Performance Computing Applications, pp.481-494, 2006.

M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica, Spark: Cluster computing with working sets, Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing (HotCloud'10). USENIX Association, pp.10-10, 2010.

H. Jin, S. Ibrahim, L. Qi, H. Cao, S. Wu et al., The MapReduce Programming Model and Implementations, Cloud computing: Principles and Paradigms, pp.373-390, 2011.
DOI : 10.1002/9780470940105.ch14

D. Huang, X. Shi, S. Ibrahim, L. Lu, H. Liu et al., MR-scope, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC '10, pp.849-855, 2010.
DOI : 10.1145/1851476.1851598

F. Dinu and T. E. Ng, Understanding the effects and implications of compute node related failures in hadoop, Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing, HPDC '12, pp.187-198, 2012.
DOI : 10.1145/2287076.2287108

G. Ananthanarayanan, S. Kandula, A. Greenberg, I. Stoica, Y. Lu et al., Reining in the outliers in map-reduce clusters using mantri, Proceedings of the 9th USENIX Conference on Operating Systems Design and Implementation (OSDI'10). USENIX Association, pp.1-16, 2010.

Q. Chen, C. Liu, and Z. Xiao, Improving MapReduce performance uisng smart speculative execution strategy, IEEE Transactions on Computers, pp.29-42, 2014.

G. Ananthanarayanan, A. Ghodsi, S. Shenker, and I. Stoica, Effective straggler mitigation: Attack of the clones, Proceedings of the 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI'13). USENIX Association, pp.185-198, 2013.

J. Kim, J. Chou, and D. Rotem, Energy Proportionality and Performance in Data Parallel Computing Clusters, Proceedings of the 23rd International Conference on Scientific and Statistical Database Management (SSDBM'11, pp.414-431, 2011.
DOI : 10.1145/1740390.1740405

J. Leverich and C. Kozyrakis, On the energy (in)efficiency of Hadoop clusters, ACM SIGOPS Operating Systems Review, vol.44, issue.1, pp.61-65, 2010.
DOI : 10.1145/1740390.1740405

E. Thereska, A. Donnelly, and D. Narayanan, Sierra, Proceedings of the sixth conference on Computer systems, EuroSys '11, pp.169-182, 2011.
DOI : 10.1145/1966445.1966461

Y. Chen, L. Keys, and R. H. Katz, Towards energy efficient MapReduce, EECS Department, 2009.

Y. Chen, A. Ganapathi, and R. H. Katz, To compress or not to compress - compute vs. IO tradeoffs for mapreduce energy efficiency, Proceedings of the first ACM SIGCOMM workshop on Green networking, Green Networking '10, pp.23-28, 2010.
DOI : 10.1145/1851290.1851296

Y. Chen, S. Alspaugh, D. Borthakur, and R. Katz, Energy efficiency for large-scale MapReduce workloads with significant interactive analysis, Proceedings of the 7th ACM european conference on Computer Systems, EuroSys '12, pp.43-56, 2012.
DOI : 10.1145/2168836.2168842

S. Ibrahim, T. Phan, A. Carpen-amarie, H. Chihoub, D. Moise et al., Governing energy consumption in Hadoop through CPU frequency scaling: An analysis, Future Generation Computer Systems, vol.54, 2015.
DOI : 10.1016/j.future.2015.01.005

URL : https://hal.archives-ouvertes.fr/hal-01166252

Y. Kwon, M. Balazinska, B. Howe, and J. Rolia, A study of skew in mapreduce applications, Proceedings of the 5th Open Cirrus Summit, 2011.

S. Ibrahim, H. Jin, L. Lu, B. He, G. Antoniu et al., Maestro: Replica-Aware Map Scheduling for MapReduce, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012), pp.59-72, 2012.
DOI : 10.1109/CCGrid.2012.122

URL : https://hal.archives-ouvertes.fr/hal-00670813