Improving Power Efficiency Through Fine-Grain Performance Monitoring in HPC Clusters - Archive ouverte HAL Access content directly
Conference Papers Year :

Improving Power Efficiency Through Fine-Grain Performance Monitoring in HPC Clusters

(1) , (2)
1
2
Mathieu Stoffel
  • Function : Author
  • PersonId : 1045764
Abdelhafid Mazouz
  • Function : Author
  • PersonId : 878191

Abstract

Nowadays, power and energy consumption are of paramount importance. Further, reaching the Exascale target will not be possible in the short term without major breakthroughs in software and hardware technologies to meet power consumption constraints. In this context, this papers discusses the design and implementation of a system-wide tool to monitor, analyze and control power/energy consumption in HPC clusters. We developed a lightweight tool that relies on a fine-grain sampling of two CPU performance metrics: instructions throughput (IPC) and last level cache bandwidth. Thanks to the information provided by these metrics about hardware resources' activity, and using DVFS to control power/performance, we show that it is possible to achieve up to 16% energy savings at the cost of less than 3% performance degradation on real HPC applications.
Not file

Dates and versions

hal-02102454 , version 1 (17-04-2019)

Identifiers

Cite

Mathieu Stoffel, Abdelhafid Mazouz. Improving Power Efficiency Through Fine-Grain Performance Monitoring in HPC Clusters. CLUSTER 2018 - IEEE International Conference on Cluster Computing, Sep 2018, Belfast, United Kingdom. pp.552-561, ⟨10.1109/CLUSTER.2018.00071⟩. ⟨hal-02102454⟩
45 View
0 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More