Toward Better Simulation of MPI Applications on Ethernet/TCP Networks

Abstract : Simulation and modeling for performance prediction and profiling is essential for developing and maintaining HPC code that is expected to scale for next-generation exascale systems, and correctly modeling network behavior is essential for creating realistic simulations. In this article we describe an implementation of a flow-based hybrid network model that accounts for factors such as network topology and contention, which are commonly ignored by other approaches. We focus on large-scale, Ethernet-connected systems, as these currently compose 37.8% of the TOP500 index, and this share is expected to increase as higher-speed 10 and 100GbE become more available. The European Mont-Blanc project to study exascale computing by developing prototype systems with low-power embedded devices will also use Ethernet-based interconnect. Our model is implemented within SMPI, an open-source MPI implementation that connects real applications to the SimGrid simulation framework. SMPI provides implementations of collective communications based on current versions of both OpenMPI and MPICH. SMPI and SimGrid also provide methods for easing the simulation of large-scale systems, including shadow execution, memory folding, and support for both online and offline (i.e., post-mortem) simulation. We validate our proposed model by comparing traces produced by SMPI with those from real world experiments, as well as with those obtained using other established network models. Our study shows that SMPI has a consistently better predictive power than classical LogP-based models for a wide range of scenarios including both established HPC benchmarks and real applications.
Type de document :
Communication dans un congrès
PMBS13 - 4th International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems, Nov 2013, Denver, United States. 2013
Liste complète des métadonnées

Littérature citée [41 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00919507
Contributeur : Arnaud Legrand <>
Soumis le : lundi 16 décembre 2013 - 21:35:44
Dernière modification le : mercredi 6 septembre 2017 - 11:25:52
Document(s) archivé(s) le : samedi 8 avril 2017 - 07:19:26

Fichiers

smpi_pmbs13.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00919507, version 1

Citation

Paul Bedaride, Augustin Degomme, Stéphane Genaud, Arnaud Legrand, George Markomanolis, et al.. Toward Better Simulation of MPI Applications on Ethernet/TCP Networks. PMBS13 - 4th International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems, Nov 2013, Denver, United States. 2013. 〈hal-00919507〉

Partager

Métriques

Consultations de
la notice

1107

Téléchargements du document

736