NIC-assisted cache-efficient receive stack for message passing over Ethernet

Abstract : High-speed networking in clusters usually relies on advanced hardware features in the NICs, such as zero-copy capability. Open-MX is a high-performance message passing stack tailored for regular Ethernet hardware without such capabilities. We present the addition of a multiqueue support in the Open-MX receive stack so that all incoming packets for the same process are handled on the same core. We then introduce the idea of binding the target end process near its dedicated receive queue. This model leads to a more cache-efficient receive stack for Open-MX. It also proves that very simple and stateless hardware features may have a significant impact on message passing performance over Ethernet. The implementation of this model in a firmware reveals that it may not be as efficient as some manually tuned micro-benchmarks. But our multiqueue receive stack generally performs better than the original single queue stack, especially on large communication patterns where multiple processes are involved and manual binding is difficult.
Document type :
Journal articles
Concurrency and Computation: Practice and Experience, John Wiley and Sons, 2011, Special Issue: Euro-Par 2009, 23 (2), pp.199-210. <10.1002/cpe.1632>


https://hal.inria.fr/inria-00496301
Contributor : Brice Goglin <>
Submitted on : Wednesday, June 30, 2010 - 2:37:22 PM
Last modification on : Monday, May 6, 2013 - 9:41:48 PM

File

article.pdf
fileSource_public_author

Identifiers

Citation

Brice Goglin. NIC-assisted cache-efficient receive stack for message passing over Ethernet. Concurrency and Computation: Practice and Experience, John Wiley and Sons, 2011, Special Issue: Euro-Par 2009, 23 (2), pp.199-210. <10.1002/cpe.1632>. <inria-00496301>

Export

Share

Metrics

Consultation de
la notice

273

Téléchargement du document

132