Toward More Scalable Off-Line Simulations of MPI Applications

Abstract : The off-line (or post-mortem) analysis of execution event traces is a popular approach to understand the performance of HPC applications that use the message passing paradigm. Combining this analysis with simulation makes it possible to " replay " the application execution to explore " what if? " scenarios, e.g., assessing application performance in a range of (hypothetical) execution environments. However, such off-line analysis faces scalability issues for acquiring, storing, or replaying large event traces. We first present two previously proposed and complementary frameworks for off-line replaying of MPI application event traces, each with its own objectives and limitations. We then describe how these frameworks can be combined so as to capitalize on their respective strengths while alleviating several of their limitations. We claim that the combined framework affords levels of scalability that are beyond that achievable by either one of the two individual frameworks. We evaluate this framework to illustrate the benefits of the proposed combination for a more scalable off-line analysis of MPI applications.
Type de document :
Article dans une revue
Parallel Processing Letters, World Scientific Publishing, 2015, 25 (3), <10.1142/S0129626415410029>
Liste complète des métadonnées


https://hal.inria.fr/hal-01232787
Contributeur : Frédéric Suter <>
Soumis le : mardi 24 novembre 2015 - 11:08:12
Dernière modification le : mardi 10 mai 2016 - 08:46:19
Document(s) archivé(s) le : vendredi 28 avril 2017 - 17:02:16

Fichier

scalatrace-ti_hal.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Henri Casanova, Anshul Gupta, Frédéric Suter. Toward More Scalable Off-Line Simulations of MPI Applications. Parallel Processing Letters, World Scientific Publishing, 2015, 25 (3), <10.1142/S0129626415410029>. <hal-01232787>

Partager

Métriques

Consultations de
la notice

144

Téléchargements du document

101