Benchmarking Dependability of MapReduce Systems

Abstract : MapReduce is a popular programming model for distributed data processing. Extensive research has been con- ducted on the reliability of MapReduce, ranging from adaptive and on-demand fault-tolerance to new fault-tolerance models. However, realistic benchmarks are still missing to analyze and compare the effectiveness of these proposals. To date, most MapReduce fault-tolerance solutions have been evaluated using microbenchmarks in an ad-hoc and overly simplified setting, which may not be representative of real-world applications. This paper presents MRBS, a comprehensive benchmark suite for evaluating the dependability of MapReduce systems. MRBS includes five benchmarks covering several application domains and a wide range of execution scenarios such as data-intensive vs. compute-intensive applications, or batch applications vs. online interactive applications. MRBS allows to inject various types of faults at different rates. It also considers different application workloads and dataloads, and produces extensive reliability, availability and performance statistics. We illustrate the use of MRBS with Hadoop clusters running on Amazon EC2, and on a private cloud.
Type de document :
Communication dans un congrès
The 31st IEEE International Symposium on Reliable Distributed Systems (SRDS), Oct 2012, Irvine, California, United States. 2012
Liste complète des métadonnées

https://hal.inria.fr/hal-00950645
Contributeur : Sara Bouchenak <>
Soumis le : vendredi 21 février 2014 - 17:27:09
Dernière modification le : jeudi 11 janvier 2018 - 06:21:05

Identifiants

  • HAL Id : hal-00950645, version 1

Collections

Citation

Amit Sangroya, Damián Serrano, Sara Bouchenak. Benchmarking Dependability of MapReduce Systems. The 31st IEEE International Symposium on Reliable Distributed Systems (SRDS), Oct 2012, Irvine, California, United States. 2012. 〈hal-00950645〉

Partager

Métriques

Consultations de la notice

157