pioman: a Generic Framework for Asynchronous Progression and Multithreaded Communications

Alexandre Denis 1, 2
2 RUNTIME - Efficient runtime systems for parallel architectures
Inria Bordeaux - Sud-Ouest, UB - Université de Bordeaux, CNRS - Centre National de la Recherche Scientifique : UMR5800
Abstract : Recent cluster architectures include dozens of cores per node, with all cores sharing the network resources. To program such architectures, hybrid models mixing MPI+threads, and in particular MPI+OpenMP are gaining popularity. This imposes new requirements on communication libraries, such as the need for MPI_THREAD_MULTIPLE level of multi-threading support. Moreover, the high number of cores brings new opportunities to parallelize communication libraries, so as to have proper background progression of communication and communication/computation overlap. In this paper, we present pioman, a generic framework to be used by MPI implementations, that brings seamless asynchronous progression of communication by opportunistically using available cores. It uses system threads and thus is composable with any runtime system used for multithreading. Through various benchmarks, we demonstrate that our \texttt{pioman}-based MPI implementation exhibits very good properties regarding overlap, progression, and multithreading, and outperforms state-of-art MPI implementations.
Keywords : MPI pioman NewMadeleine
Type de document :
Communication dans un congrès
IEEE International Conference on Cluster Computing (IEEE Cluster), Sep 2014, Madrid, Spain. 2014
Liste complète des métadonnées

https://hal.inria.fr/hal-01064652
Contributeur : Alexandre Denis <>
Soumis le : mardi 16 septembre 2014 - 17:19:51
Dernière modification le : mercredi 29 novembre 2017 - 15:10:40
Document(s) archivé(s) le : mercredi 17 décembre 2014 - 11:46:03

Identifiants

  • HAL Id : hal-01064652, version 1

Collections

Citation

Alexandre Denis. pioman: a Generic Framework for Asynchronous Progression and Multithreaded Communications. IEEE International Conference on Cluster Computing (IEEE Cluster), Sep 2014, Madrid, Spain. 2014. 〈hal-01064652〉

Partager

Métriques

Consultations de la notice

232

Téléchargements de fichiers

206