Programming models and runtimes - Archive ouverte HAL Access content directly
Book Sections Year : 2019

Programming models and runtimes

(1, 2) , (3) , (4) , (5) , (5) , (6) , (7) , (8) , (3) , (9) , (5) , (10) , (11, 12) , (8)
1
2
3
4
5
6
7
8
9
10
11
12

Abstract

Several millions of execution flows will be executed in ultrascale computing systems (UCS), and the task for the programmer to understand their coherency and for the runtime to coordinate them is unfathomable. Moreover, related to UCS large scale and their impact on reliability, the current static point of view is not more sufficient. A runtime cannot consider to restart an application because of the failure of a single node as statically several nodes will fail every day. Classical management of these failures by the programmers using checkpoint restart is also too limited due to the overhead at such a scale. The article explores programming models and runtimes required to facilitate the task of scaling and extracting performance on continuously evolving platforms, while providing resilience and fault-tolerant mechanisms to tackle the increasing probability of failures throughout the whole software stack.
Not file

Dates and versions

hal-02403121 , version 1 (10-12-2019)

Identifiers

Cite

Georges Da Costa, Alexey L. Lastovetsky, Jorge G. Barbosa, Juan Carlos Diaz Martin, Juan-Luis Garcia Zapata, et al.. Programming models and runtimes. Jesus Carretero; Emmanuel Jeannot; Albert Zomaya. Ultrascale Computing Systems, 2, Institution of Engineering and Technology, pp.9-63, 2019, 978-1785618338. ⟨10.1049/PBPC024E_ch2⟩. ⟨hal-02403121⟩
116 View
0 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More