Skip to Main content Skip to Navigation
Conference papers

Application-Driven Requirements for Node Resource Management in Next-Generation Systems

Abstract : Emerging workloads on supercomputing platforms are pushing the limits of traditional high-performance computing software environments. Multi-physics, coupled simulations, big data processing and machine learning frameworks, and multi-component workloads pose serious challenges to system and application developers. At the heart of the problem is the lack of cross-stack coordination to enable flexible resource management among multiple runtime components. In this work we analyze seven, real-world applications that represent emerging workloads and illustrate the scope and magnitude of the problem. We then extract several themes from these applications that highlight next-generation requirements for node resource managers. Finally, using these requirements, we propose a general, cross-stack coordination framework and outline its components and functionality.
Complete list of metadatas

Cited literature [40 references]  Display  Hide  Download

https://hal.inria.fr/hal-02950635
Contributor : Brice Goglin <>
Submitted on : Wednesday, October 14, 2020 - 11:03:11 AM
Last modification on : Thursday, October 15, 2020 - 4:10:19 AM

File

main.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02950635, version 2

Collections

Citation

Edgar León, Balazs Gerofi, Julien Jaeger, Guillaume Mercier, Rolf Riesen, et al.. Application-Driven Requirements for Node Resource Management in Next-Generation Systems. ROSS 2020 : International Workshop on Runtime and Operating Systems for Supercomputers, Nov 2020, Atlanta, GA / Virtual, United States. ⟨hal-02950635v2⟩

Share

Metrics

Record views

14

Files downloads

18