Skip to Main content Skip to Navigation
New interface
Conference papers

A multithreaded communication engine for multicore architectures

François Trahay 1, 2 Elisabeth Brunet 1, 2 Alexandre Denis 1, 2 Raymond Namyst 1, 2 
2 RUNTIME - Efficient runtime systems for parallel architectures
Inria Bordeaux - Sud-Ouest, UB - Université de Bordeaux, CNRS - Centre National de la Recherche Scientifique : UMR5800
Abstract : The current trend in clusters leads towards an increase of the number of cores per node. As a result, an increasing number of parallel applications is mixing message passing and multithreading as an attempt to better match the underlying architecture's structure. This naturally raises the problem of designing efficient, multithreaded implementations of MPI. In this paper, we present the design of a multithreaded communication engine able to exploit idle cores to speed up communications in two ways: it can move CPU-intensive operations out of the critical path (e.g. PIO transfers offload), and is able to let rendezvous transfers progress asynchronously. We have implemented these methods in the PM2 software suite, evaluated their behavior in typical cases, and we have observed good performance results in overlapping communication and computation.
Complete list of metadata

Cited literature [10 references]  Display  Hide  Download
Contributor : François Trahay Connect in order to contact the contributor
Submitted on : Wednesday, January 30, 2008 - 11:09:34 AM
Last modification on : Saturday, June 25, 2022 - 7:47:18 PM
Long-term archiving on: : Friday, April 30, 2010 - 9:52:06 PM


Files produced by the author(s)



François Trahay, Elisabeth Brunet, Alexandre Denis, Raymond Namyst. A multithreaded communication engine for multicore architectures. Communication Architecture for Clusters, Apr 2008, Miami, United States. ⟨10.1109/IPDPS.2008.4536139⟩. ⟨inria-00224999⟩



Record views


Files downloads