Skip to Main content Skip to Navigation
Conference papers

Sustaining Performance While Reducing Energy Consumption: A Control Theory Approach

Sophie Cerf 1 Raphaël Bleuse 1 Valentin Reis 2 Swann Perarnau 2 Eric Rutten 1 
1 CTRL-A - Control for Autonomic computing systems
Inria Grenoble - Rhône-Alpes, LIG - Laboratoire d'Informatique de Grenoble
Abstract : Production high-performance computing systems continue to grow in complexity and size. As applications struggle to make use of increasingly heterogeneous compute nodes, maintaining high efficiency (performance per watt) for the whole platform becomes a challenge. Alongside the growing complexity of scientific workloads, this extreme heterogeneity is also an opportunity: as applications dynamically undergo variations in workload, due to phases or data/compute movement between devices, one can dynamically adjust power across compute elements to save energy without impacting performance. With an aim toward an autonomous and dynamic power management strategy for current and future HPC architectures, this paper explores the use of control theory for the design of a dynamic power regulation method. Structured as a feedback loop, our approach-which is novel in computing resource management-consists of periodically monitoring application progress and choosing at runtime a suitable power cap for processors. Thanks to a preliminary offline identification process, we derive a model of the dynamics of the system and a proportional-integral (PI) controller. We evaluate our approach on top of an existing resource management framework, the Argo Node Resource Manager, deployed on several clusters of Grid'5000, using a standard memory-bound HPC benchmark.
Complete list of metadata

https://hal.inria.fr/hal-03259316
Contributor : Raphaël Bleuse Connect in order to contact the contributor
Submitted on : Monday, July 5, 2021 - 5:55:06 PM
Last modification on : Tuesday, June 14, 2022 - 12:20:39 PM
Long-term archiving on: : Wednesday, October 6, 2021 - 6:01:11 PM

Files

main.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Sophie Cerf, Raphaël Bleuse, Valentin Reis, Swann Perarnau, Eric Rutten. Sustaining Performance While Reducing Energy Consumption: A Control Theory Approach. EURO-PAR 2021 - 27th International European Conference on Parallel and Distributed Computing, Aug 2021, Lisbon, Portugal. pp.334-349, ⟨10.1007/978-3-030-85665-6_21⟩. ⟨hal-03259316⟩

Share

Metrics

Record views

136

Files downloads

168