Skip to Main content Skip to Navigation
Book sections

Programming models and runtimes

Abstract : Several millions of execution flows will be executed in ultrascale computing systems (UCS), and the task for the programmer to understand their coherency and for the runtime to coordinate them is unfathomable. Moreover, related to UCS large scale and their impact on reliability, the current static point of view is not more sufficient. A runtime cannot consider to restart an application because of the failure of a single node as statically several nodes will fail every day. Classical management of these failures by the programmers using checkpoint restart is also too limited due to the overhead at such a scale. The article explores programming models and runtimes required to facilitate the task of scaling and extracting performance on continuously evolving platforms, while providing resilience and fault-tolerant mechanisms to tackle the increasing probability of failures throughout the whole software stack.
Complete list of metadatas

https://hal.inria.fr/hal-02403121
Contributor : Emmanuel Jeannot <>
Submitted on : Tuesday, December 10, 2019 - 5:33:39 PM
Last modification on : Sunday, November 15, 2020 - 7:40:06 PM

Identifiers

Citation

Georges Da Costa, Alexey L. Lastovetsky, Jorge G. Barbosa, Juan Carlos Diaz Martin, Juan-Luis Garcia Zapata, et al.. Programming models and runtimes. Jesus Carretero; Emmanuel Jeannot; Albert Zomaya. Ultrascale Computing Systems, 2, Institution of Engineering and Technology, pp.9-63, 2019, 978-1785618338. ⟨10.1049/PBPC024E_ch2⟩. ⟨hal-02403121⟩

Share

Metrics

Record views

278