Skip to Main content Skip to Navigation
Conference papers

A Runtime Framework for Energy Efficient HPC Systems Without a Priori Knowledge of Applications

Abstract : The rising computing demands of scientific endeavours often require the creation and management of High Performance Computing (HPC) systems for running experiments and processing vast amounts of data. These HPC systems generally operate at peak performance, consuming a large quantity of electricity, even though their workload varies over time. Understanding the behavioural patterns i.e., phases) of HPC systems during their use is key to adjust performance to resource demand and hence improve the energy efficiency. In this paper, we describe (i) a method to detect phases of an HPC system based on its workload, and (ii) a partial phase recognition technique that works cooperatively with on-the-fly dynamic management. We implement a prototype that guides the use of energy saving capabilities to demonstrate the benefits of our approach. Experimental results reveal the effectiveness of the phase detection method under real-life workload and benchmarks. A comparison with baseline unmanaged execution shows that the partial phase recognition technique saves up to 15% of energy with less than 1% performance degradation.
Complete list of metadata

Cited literature [13 references]  Display  Hide  Download
Contributor : Ghislain Landry Tsafack Chetsa Connect in order to contact the contributor
Submitted on : Friday, February 22, 2013 - 5:49:27 PM
Last modification on : Thursday, January 20, 2022 - 4:14:38 PM
Long-term archiving on: : Sunday, April 2, 2017 - 4:33:16 AM


Files produced by the author(s)


  • HAL Id : hal-00793685, version 1


Ghislain Landry Tsafack Chetsa, Laurent Lefèvre, Jean-Marc Pierson, Patricia Stolf, Georges da Costa. A Runtime Framework for Energy Efficient HPC Systems Without a Priori Knowledge of Applications. ICPAD 2012 : 18th International Conference on Parallel and Distributed Systems, Nov 2012, Singapour, Singapore. pp.660-667. ⟨hal-00793685⟩



Les métriques sont temporairement indisponibles