On Understanding the Energy Impact of Speculative Execution in Hadoop

Tien-Dat Phan 1 Shadi Ibrahim 1 Gabriel Antoniu 1 Luc Bougé 1
1 KerData - Scalable Storage for Clouds and Beyond
Inria Rennes – Bretagne Atlantique , IRISA-D1 - SYSTÈMES LARGE ÉCHELLE
Abstract : Hadoop emerged as an important system for large- scale data analysis. Speculative execution is a key feature in Hadoop that is extensively leveraged in clouds: it is used to mask slow tasks (i.e., stragglers) — resulted from resource contention and heterogeneity in clouds — by launching speculative task copies on other machines. However, speculative execution is not cost-free and may result in performance degradation and extra resource and energy consumption. While prior literature has been dedicated to improving stragglers detection to cope with the inevitable heterogeneity in clouds, little work is focusing on understanding the implications of speculative execution on the performance and energy consumption in Hadoop cluster. In this paper, we have designed a set of experiments to evaluate the impact of speculative execution on the performance and energy consumption of Hadoop in homo- and heterogeneous environments. Our studies reveal that speculative execution may sometimes reduce, sometimes increase the energy consumption of Hadoop clusters. This strongly depends on the reduction in the execution time of MapReduce applications and on the extra power consumption introduced by speculative execution. Moreover, we show that the extra power consumption varies in-between applications and is contributed to by three main factors: the duration of speculative tasks, the idle time, and the allocation of speculative tasks. To the best of our knowledge, our work provides the first deep look into the energy efficiency of speculative execution in Hadoop.
Complete list of metadatas

Cited literature [32 references]  Display  Hide  Download

https://hal.inria.fr/hal-01238055
Contributor : Shadi Ibrahim <>
Submitted on : Friday, December 4, 2015 - 1:28:39 PM
Last modification on : Thursday, February 7, 2019 - 3:25:58 PM
Long-term archiving on : Saturday, April 29, 2017 - 6:01:00 AM

File

GreenCom-2015.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01238055, version 1

Citation

Tien-Dat Phan, Shadi Ibrahim, Gabriel Antoniu, Luc Bougé. On Understanding the Energy Impact of Speculative Execution in Hadoop. GreenCom'15-The 2015 IEEE International Conference on Green Computing and Communications , Dec 2015, Sydney, Australia. ⟨hal-01238055⟩

Share

Metrics

Record views

1053

Files downloads

615