Puma: Purdue Mapreduce benchmarks suite. ECE Technical Reports, 2012. ,
Supporting fault tolerance in a data-intensive computing middleware, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), pp.1-12, 2010. ,
DOI : 10.1109/IPDPS.2010.5470462
Facebook has the world's largest Hadoop clus- ter! http, 2015. ,
Large-scale distributed systems at google: Current systems and future directions, Keynote speech at the 3rd ACM SIGOPS International Workshop on Large Scale Distributed Systems and Middleware (LADIS), 2009. ,
MapReduce, Proceedings of the 6th USENIX conference on Symposium on Opearting Systems Design & Implementation (OSDI '04), pp.137-150, 2004. ,
DOI : 10.1145/1327452.1327492
MapReduce, Communications of the ACM, vol.51, issue.1, pp.107-113, 2008. ,
DOI : 10.1145/1327452.1327492
Understanding the effects and implications of compute node related failures in hadoop, Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing, HPDC '12, pp.187-198 ,
DOI : 10.1145/2287076.2287108
Above the clouds: A berkeley view of cloud computing ,
13. Derek Gottfrid. Self-service, prorated supercomputing fun! http://open.blogs.nytimes.com/self-service-prorated-super- computing-fun, p.13, 2007. ,
MR-scope, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC '10, pp.849-855, 2010. ,
DOI : 10.1145/1851476.1851598
Towards Pay-As-You-Consume Cloud Computing, 2011 IEEE International Conference on Services Computing, pp.370-377, 2011. ,
DOI : 10.1109/SCC.2011.38
Maestro: Replica-Aware Map Scheduling for MapReduce, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012), pp.59-72, 2012. ,
DOI : 10.1109/CCGrid.2012.122
URL : https://hal.archives-ouvertes.fr/hal-00670813
Evaluating MapReduce on Virtual Machines: The Hadoop Case, Proceedings of the 1st International Conference on Cloud Computing (CLOUDCOM'09), pp.519-528, 2009. ,
DOI : 10.1007/978-3-642-10665-1_47
The mapreduce programming model and implementations. Cloud Computing: Principles and Paradigms, pp.373-390, 2011. ,
Trojan data layouts, Proceedings of the 2nd ACM Symposium on Cloud Computing, SOCC '11, pp.1-21, 2011. ,
DOI : 10.1145/2038916.2038937
Making cloud intermediate data fault-tolerant, the 1st ACM symposium on Cloud computing, pp.181-192, 2010. ,
Companies are spending a lot on Big Data. http://sites.tcs.com/ big-data-study/spending-on-big-data, 2015. ,
Stateful bulk processing for incremental analytics, Proceedings of the 1st ACM symposium on Cloud computing, SoCC '10, pp.51-62, 2010. ,
DOI : 10.1145/1807128.1807138
Runtime measurements in the cloud, Proceedings of the VLDB Endowment, vol.3, issue.1-2, pp.460-471, 2010. ,
DOI : 10.14778/1920841.1920902
Performance issues of heterogeneous hadoop clusters in cloud computing. ArXiv e-prints, 2012. ,
Hive - a petabyte scale data warehouse using Hadoop, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010), pp.996-1005, 2010. ,
DOI : 10.1109/ICDE.2010.5447738
Delay scheduling, Proceedings of the 5th European conference on Computer systems, EuroSys '10, pp.265-278, 2010. ,
DOI : 10.1145/1755913.1755940
Improving mapreduce performance in heterogeneous environments, Proceedings of the 8th USENIX Conference on Operating Systems Design and Implementation (OSDI'08), pp.29-42, 2008. ,
Adaptive Failure Detection via Heartbeat under Hadoop, 2011 IEEE Asia-Pacific Services Computing Conference, pp.231-238, 2011. ,
DOI : 10.1109/APSCC.2011.46