Application-Driven Requirements for Node Resource Management in Next-Generation Systems - Archive ouverte HAL Access content directly
Conference Papers Year :

Application-Driven Requirements for Node Resource Management in Next-Generation Systems

(1) , (2) , (3) , (4) , (5) , (2) , (6)
1
2
3
4
5
6

Abstract

Emerging workloads on supercomputing platforms are pushing the limits of traditional high-performance computing software environments. Multi-physics, coupled simulations, big data processing and machine learning frameworks, and multi-component workloads pose serious challenges to system and application developers. At the heart of the problem is the lack of cross-stack coordination to enable flexible resource management among multiple runtime components. In this work we analyze seven, real-world applications that represent emerging workloads and illustrate the scope and magnitude of the problem. We then extract several themes from these applications that highlight next-generation requirements for node resource managers. Finally, using these requirements, we propose a general, cross-stack coordination framework and outline its components and functionality.
Fichier principal
Vignette du fichier
main.pdf (516.28 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-02950635 , version 1 (28-09-2020)
hal-02950635 , version 2 (14-10-2020)

Identifiers

  • HAL Id : hal-02950635 , version 2

Cite

Edgar A León, Balazs Gerofi, Julien Jaeger, Guillaume Mercier, Rolf Riesen, et al.. Application-Driven Requirements for Node Resource Management in Next-Generation Systems. ROSS 2020 : International Workshop on Runtime and Operating Systems for Supercomputers, Nov 2020, Atlanta, GA / Virtual, United States. ⟨hal-02950635v2⟩
203 View
206 Download

Share

Gmail Facebook Twitter LinkedIn More