Skip to Main content Skip to Navigation
New interface
Conference papers

Improving MPI Applications Performance on Multicore Clusters with Rank Reordering

Guillaume Mercier 1, 2 Emmanuel Jeannot 1 
1 RUNTIME - Efficient runtime systems for parallel architectures
Inria Bordeaux - Sud-Ouest, UB - Université de Bordeaux, CNRS - Centre National de la Recherche Scientifique : UMR5800
Abstract : Modern hardware architectures featuring multicores and a complex memory hierarchy raise challenges that need to be addressed by parallel applications programmers. It is therefore tempting to adapt an application communication pattern to the characteristics of the underlying hardware. The MPI standard features several functions that allow the ranks of MPI processes to be reordered according to a graph attached to a newly created communicator. In this paper, we explain how the MPICH2 implementation of the MPI_Dist_graph_create function was modified to reorder the MPI process ranks to create a match between the application communication pattern and the hardware topology. The experimental results on a multicore cluster show that improvements can be achieved as long as the application communication pattern is expressed by a relevant metric.
Complete list of metadata

Cited literature [15 references]  Display  Hide  Download
Contributor : Guillaume Mercier Connect in order to contact the contributor
Submitted on : Monday, November 21, 2011 - 11:59:03 AM
Last modification on : Saturday, June 25, 2022 - 7:41:00 PM
Long-term archiving on: : Friday, November 16, 2012 - 11:32:11 AM


Files produced by the author(s)




Guillaume Mercier, Emmanuel Jeannot. Improving MPI Applications Performance on Multicore Clusters with Rank Reordering. EuroMPI, Sep 2011, Santorini, Italy. pp.39-49, ⟨10.1007/978-3-642-24449-0⟩. ⟨hal-00643151⟩



Record views


Files downloads