Static/Dynamic Analyses for Validation and Improvements of Multi-Model HPC Applications.

Abstract : Supercomputing plays an important role in several innovative fields, speeding up prototyping or validating scientific theories. However, supercomputers are evolving rapidly with now millions of processing units, posing the questions of their programmability. Despite the emergence of more widespread and functional parallel programming models, developing correct and effective parallel applications still remains a complex task. Although debugging solutions have emerged to address this issue, they often come with restrictions. However programming model evolutions stress the requirement for a convenient validation tool able to handle hybrid applications. Indeed as current scientific applications mainly rely on the Message Passing Interface (MPI) parallel programming model, new hardwares designed for Exascale with higher node-level parallelism clearly advocate for an MPI+X solutions with X a thread-based model such as OpenMP. But integrating two different programming models inside the same application can be error-prone leading to complex bugs - mostly detected unfortunately at runtime. In an MPI+X program not only the correctness of MPI should be ensured but also its interactions with the multi-threaded model, for example identical MPI collective operations cannot be performed by multiple nonsynchronized threads. This thesis aims at developing a combination of static and dynamic analysis to enable an early verification of hybrid HPC applications. The first pass statically verifies the thread level required by an MPI+OpenMP application and outlines execution paths leading to potential deadlocks. Thanks to this analysis, the code is selectively instrumented, displaying an error and synchronously interrupting all processes if the actual scheduling leads to a deadlock situation.
Complete list of metadatas

Cited literature [96 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-01228072
Contributor : Abes Star <>
Submitted on : Thursday, November 12, 2015 - 3:42:52 PM
Last modification on : Tuesday, May 14, 2019 - 11:38:11 AM
Long-term archiving on : Friday, April 28, 2017 - 4:47:22 AM

File

SAILLARD_EMMANUELLE_2015.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-01228072, version 1

Collections

Citation

Emmanuelle Saillard. Static/Dynamic Analyses for Validation and Improvements of Multi-Model HPC Applications.. Distributed, Parallel, and Cluster Computing [cs.DC]. Université de Bordeaux, 2015. English. ⟨NNT : 2015BORD0176⟩. ⟨tel-01228072⟩

Share

Metrics

Record views

425

Files downloads

651