A Runtime Framework for Energy Efficient HPC Systems Without a Priori Knowledge of Applications

Abstract : The rising computing demands of scientific endeavours often require the creation and management of High Performance Computing (HPC) systems for running experiments and processing vast amounts of data. These HPC systems generally operate at peak performance, consuming a large quantity of electricity, even though their workload varies over time. Understanding the behavioural patterns i.e., phases) of HPC systems during their use is key to adjust performance to resource demand and hence improve the energy efficiency. In this paper, we describe (i) a method to detect phases of an HPC system based on its workload, and (ii) a partial phase recognition technique that works cooperatively with on-the-fly dynamic management. We implement a prototype that guides the use of energy saving capabilities to demonstrate the benefits of our approach. Experimental results reveal the effectiveness of the phase detection method under real-life workload and benchmarks. A comparison with baseline unmanaged execution shows that the partial phase recognition technique saves up to 15% of energy with less than 1% performance degradation.
Type de document :
Communication dans un congrès
ICPAD 2012 : 18th International Conference on Parallel and Distributed Systems, Nov 2012, Singapour, Singapore. IEEE, pp.660-667, 2012
Liste complète des métadonnées

Littérature citée [13 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00793685
Contributeur : Ghislain Landry Tsafack Chetsa <>
Soumis le : vendredi 22 février 2013 - 17:49:27
Dernière modification le : mercredi 23 mai 2018 - 17:58:06
Document(s) archivé(s) le : dimanche 2 avril 2017 - 04:33:16

Fichier

icpads-fv.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00793685, version 1

Citation

Ghislain Landry Tsafack Chetsa, Laurent Lefèvre, Jean-Marc Pierson, Patricia Stolf, Georges Da Costa. A Runtime Framework for Energy Efficient HPC Systems Without a Priori Knowledge of Applications. ICPAD 2012 : 18th International Conference on Parallel and Distributed Systems, Nov 2012, Singapour, Singapore. IEEE, pp.660-667, 2012. 〈hal-00793685〉

Partager

Métriques

Consultations de la notice

511

Téléchargements de fichiers

644