Voltage Overscaling Algorithms for Energy-Efficient Workflow Computations With Timing Errors

Abstract : We propose a software-based approach using dynamic voltage overscaling to reduce the energy consumption of HPC applications. This technique aggressively lowers the supply voltage below nominal voltage, which introduces timing errors , and we use Algorithm-Based Fault-Tolerance (ABFT) to provide fault tolerance for matrix operations. We introduce a formal model, and we design optimal polynomial-time solutions, to execute a linear chain of tasks. Evaluation results obtained for matrix multiplication demonstrate that our approach indeed leads to significant energy savings, compared to the standard algorithm that always operates at nominal voltage.
Type de document :
Communication dans un congrès
FTXS '15: 5th Workshop on Fault Tolerance for HPC at eXtreme Scale, Jun 2015, Portland, United States. ACM, FTXS '15 Proceedings of the 5th Workshop on Fault Tolerance for HPC at eXtreme Scale, pp.8, 2015, 〈10.1145/2751504.2751508〉
Liste complète des métadonnées

Littérature citée [9 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01199250
Contributeur : Equipe Roma <>
Soumis le : vendredi 25 septembre 2015 - 11:35:54
Dernière modification le : vendredi 20 avril 2018 - 15:44:27
Document(s) archivé(s) le : mardi 29 décembre 2015 - 07:03:25

Fichier

FTXS4HAL.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Aurélien Cavelan, Yves Robert, Hongyang Sun, Frédéric Vivien. Voltage Overscaling Algorithms for Energy-Efficient Workflow Computations With Timing Errors. FTXS '15: 5th Workshop on Fault Tolerance for HPC at eXtreme Scale, Jun 2015, Portland, United States. ACM, FTXS '15 Proceedings of the 5th Workshop on Fault Tolerance for HPC at eXtreme Scale, pp.8, 2015, 〈10.1145/2751504.2751508〉. 〈hal-01199250〉

Partager

Métriques

Consultations de la notice

213

Téléchargements de fichiers

79