Skip to Main content Skip to Navigation
Conference papers

Finding a Tradeoff between Host Interrupt Load and MPI Latency over Ethernet

Brice Goglin 1, 2 Nathalie Furmento 1, 2
2 RUNTIME - Efficient runtime systems for parallel architectures
Inria Bordeaux - Sud-Ouest, UB - Université de Bordeaux, CNRS - Centre National de la Recherche Scientifique : UMR5800
Abstract : Achieving high-performance message passing on top of generic Ethernet hardware suffers from the NIC interrupt-driven model where coalescing is usually involved. We present an in-depth study of the impact of interrupt coalescing on the Open-MX performance. It shows that disabling coalescing may not be relevant for most metrics except small-message latency. Two new coalescing strategies are then presented so as to efficiently support both latency-friendly and coalescing-friendly workloads thanks to the NIC looking at Open-MX messages and streams before deciding when to raise interrupts. The implementation of these strategies in the firmware of Myri-10G NICs shows that Open-MX is now able to achieve a low small-message latency, a high large-message throughput, and a satisfying message rate without having to manually tune the coalescing delay depending on the benchmark. Real application performance evaluation further shows that our modifications even improve the NAS Parallel Benchmark IS execution time by 7-8% thanks to our NIC firmware raising up to 20% of additional interrupts at the correct time.
Document type :
Conference papers
Complete list of metadata

Cited literature [12 references]  Display  Hide  Download
Contributor : Brice Goglin Connect in order to contact the contributor
Submitted on : Saturday, June 20, 2009 - 6:52:21 PM
Last modification on : Monday, December 20, 2021 - 4:50:11 PM
Long-term archiving on: : Monday, October 15, 2012 - 2:40:43 PM


Files produced by the author(s)



Brice Goglin, Nathalie Furmento. Finding a Tradeoff between Host Interrupt Load and MPI Latency over Ethernet. Cluster 2009, Sep 2009, New Orleans, United States. ⟨10.1109/CLUSTR.2009.5289165⟩. ⟨inria-00397328⟩



Les métriques sont temporairement indisponibles