Skip to Main content Skip to Navigation
Conference papers

Improving Power Efficiency Through Fine-Grain Performance Monitoring in HPC Clusters

Mathieu Stoffel 1 Abdelhafid Mazouz 2
1 CORSE [2016-2019] - Compiler Optimization and Run-time Systems [2016-2019]
Inria Grenoble - Rhône-Alpes, LIG - Laboratoire d'Informatique de Grenoble
Abstract : Nowadays, power and energy consumption are of paramount importance. Further, reaching the Exascale target will not be possible in the short term without major breakthroughs in software and hardware technologies to meet power consumption constraints. In this context, this papers discusses the design and implementation of a system-wide tool to monitor, analyze and control power/energy consumption in HPC clusters. We developed a lightweight tool that relies on a fine-grain sampling of two CPU performance metrics: instructions throughput (IPC) and last level cache bandwidth. Thanks to the information provided by these metrics about hardware resources' activity, and using DVFS to control power/performance, we show that it is possible to achieve up to 16% energy savings at the cost of less than 3% performance degradation on real HPC applications.
Complete list of metadatas

https://hal.inria.fr/hal-02102454
Contributor : Mathieu Stoffel <>
Submitted on : Wednesday, April 17, 2019 - 12:31:28 PM
Last modification on : Tuesday, October 6, 2020 - 4:20:09 PM

Identifiers

Collections

Citation

Mathieu Stoffel, Abdelhafid Mazouz. Improving Power Efficiency Through Fine-Grain Performance Monitoring in HPC Clusters. CLUSTER 2018 - IEEE International Conference on Cluster Computing, Sep 2018, Belfast, United Kingdom. pp.552-561, ⟨10.1109/CLUSTER.2018.00071⟩. ⟨hal-02102454⟩

Share

Metrics

Record views

90