Skip to Main content Skip to Navigation
Conference papers

Application-Driven Requirements for Node Resource Management in Next-Generation Systems

Abstract : Emerging workloads on supercomputing platforms are pushing the limits of traditional high-performance computing software environments. Multi-physics, coupled simulations, big data processing and machine learning frameworks, and multi-component workloads pose serious challenges to system and application developers. At the heart of the problem is the lack of cross-stack coordination to enable flexible resource management among multiple runtime components. In this work we analyze seven, real-world applications that represent emerging workloads and illustrate the scope and magnitude of the problem. We then extract several themes from these applications that highlight next-generation requirements for node resource managers. Finally, using these requirements, we propose a general, cross-stack coordination framework and outline its components and functionality.
Complete list of metadata

Cited literature [40 references]  Display  Hide  Download
Contributor : Brice Goglin Connect in order to contact the contributor
Submitted on : Wednesday, October 14, 2020 - 11:03:11 AM
Last modification on : Tuesday, July 5, 2022 - 8:39:08 AM


Files produced by the author(s)


  • HAL Id : hal-02950635, version 2



Edgar A León, Balazs Gerofi, Julien Jaeger, Guillaume Mercier, Rolf Riesen, et al.. Application-Driven Requirements for Node Resource Management in Next-Generation Systems. ROSS 2020 : International Workshop on Runtime and Operating Systems for Supercomputers, Nov 2020, Atlanta, GA / Virtual, United States. ⟨hal-02950635v2⟩



Record views


Files downloads