Skip to Main content Skip to Navigation

On Understanding the Energy Impact of Speculative Execution in Hadoop

Tien-Dat Phan 1 Shadi Ibrahim 1 Gabriel Antoniu 1 Luc Bougé 1
1 KerData - Scalable Storage for Clouds and Beyond
IRISA-D1 - SYSTÈMES LARGE ÉCHELLE, Inria Rennes – Bretagne Atlantique
Abstract : Hadoop emerged as an important system for large- scale data analysis. Speculative execution is a key feature in Hadoop that is extensively leveraged in clouds: it is used to mask slow tasks (i.e., stragglers) — resulted from resource contention and heterogeneity in clouds — by launching speculative task copies on other machines. However, speculative execution is not cost-free and may result in performance degradation and extra resource and energy consumption. While prior literature has been dedicated to improving stragglers detection to cope with the inevitable heterogeneity in clouds, little work is focusing on understanding the implications of speculative execution on the performance and energy consumption in Hadoop cluster. In this paper, we have designed a set of experiments to evaluate the impact of speculative execution on the performance and energy consumption of Hadoop in homo- and heterogeneous environments. Our studies reveal that speculative execution may sometimes reduce, sometimes increase the energy consumption of Hadoop clusters. This strongly depends on the reduction in the execution time of MapReduce applications and on the extra power consumption introduced by speculative execution. Moreover, we show that the extra power consumption varies in-between applications and is contributed to by three main factors: the duration of speculative tasks, the idle time, and the allocation of speculative tasks. To the best of our knowledge, our work provides the first deep look into the energy efficiency of speculative execution in Hadoop.
Complete list of metadatas

Cited literature [32 references]  Display  Hide  Download
Contributor : Shadi Ibrahim <>
Submitted on : Friday, December 4, 2015 - 1:28:39 PM
Last modification on : Thursday, December 19, 2019 - 1:33:52 AM
Document(s) archivé(s) le : Saturday, April 29, 2017 - 6:01:00 AM


Files produced by the author(s)


  • HAL Id : hal-01238055, version 1


Tien-Dat Phan, Shadi Ibrahim, Gabriel Antoniu, Luc Bougé. On Understanding the Energy Impact of Speculative Execution in Hadoop. GreenCom'15-The 2015 IEEE International Conference on Green Computing and Communications , Dec 2015, Sydney, Australia. ⟨hal-01238055⟩



Record views


Files downloads