Fast and Faithful Performance Prediction of MPI Applications: the HPL Case Study

Tom Cornebize; Arnaud Legrand; Franz C Heinrich

doi:10.1109/CLUSTER.2019.8891011

Communication Dans Un Congrès Année : 2019

Fast and Faithful Performance Prediction of MPI Applications: the HPL Case Study

(1) , (2) , (3)

1
2
3

Tom Cornebize

Fonction : Auteur
PersonId : 16444
IdHAL : tom-cornebize
ORCID : 0000-0003-1439-2466
IdRef : 253120039

Université Grenoble Alpes [2016-2019]

Arnaud Legrand

Fonction : Auteur
PersonId : 11445
IdHAL : arnaud-legrand
ORCID : 0000-0002-8415-1046
IdRef : 069338191

Laboratoire d'Informatique de Grenoble

Franz C Heinrich

Fonction : Auteur

Performance analysis and optimization of LARge Infrastructures and Systems

Résumé

Finely tuning MPI applications (number of processes, granularity, collective operation algorithms, topology and process placement) is critical to obtain good performance on supercomputers. With a rising cost of modern supercomputers, running parallel applications at scale solely to optimize their performance is extremely expensive. Having inexpensive but faithful predictions of expected performance could be a great help for researchers and system administrators. The methodology we propose captures the complexity of adaptive applications by emulating the MPI code while skipping insignificant parts. We demonstrate its capability with High Performance Linpack (HPL), the benchmark used to rank supercomputers in the TOP500 and which requires a careful tuning. We explain (1) how we both extended the SimGrid's SMPI simulator and slightly modified the open-source version of HPL to allow a fast emulation on a single commodity server at the scale of a supercomputer and (2) how to model the different components (network, BLAS, ...) of the system. We show that a careful modeling of both spatial and temporal node variability allows us to obtain predictions within a few percents of real experiments (see Figure 1).

Domaines

Calcul parallèle, distribué et partagé [cs.DC]

Fichier principal

paper.pdf (996.47 Ko)

slides.pdf (2.83 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Format : Présentation
Commentaire : Slides used to present this work at Cluster'19 conference (Albuquerque, New Mexico, 25/09/2019)

Tom Cornebize : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02096571

Soumis le : jeudi 26 septembre 2019-00:48:02

Dernière modification le : jeudi 4 avril 2024-21:38:47

Dates et versions

hal-02096571 , version 1 (11-04-2019)

hal-02096571 , version 2 (27-05-2019)

hal-02096571 , version 3 (27-05-2019)

hal-02096571 , version 4 (26-09-2019)

Identifiants

HAL Id : hal-02096571 , version 4
DOI : 10.1109/CLUSTER.2019.8891011

Citer

Tom Cornebize, Arnaud Legrand, Franz C Heinrich. Fast and Faithful Performance Prediction of MPI Applications: the HPL Case Study. 2019 IEEE International Conference on Cluster Computing (CLUSTER), Sep 2019, Albuquerque, United States. ⟨10.1109/CLUSTER.2019.8891011⟩. ⟨hal-02096571v4⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA LIG GRID5000 LIG_SRCPR INRIA2 LIG-SRCPR-POLARIS SILECS LIG_SIDCH

456 Consultations

1501 Téléchargements

Fast and Faithful Performance Prediction of MPI Applications: the HPL Case Study

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager