Skip to Main content Skip to Navigation
Conference papers

Improving MPI Application Communication Time with an Introspection Monitoring Library

Abstract : In this paper we describe how to improve communication time of MPI parallel applications with the use of a library that enables to monitor MPI applications and allows for introspection (the program itself can query the state of the monitoring system). Based on previous work, this library is able to see how collective communications are decomposed into point-to-point messages. It also features monitoring sessions that allow suspending and restarting the monitoring, limiting it to specific portions of the code. Experiments show that the monitoring overhead is very small and that the proposed features allow for dynamic and efficient rank reordering enabling up to 2-time reduction of communication parts of some program.
Complete list of metadata

Cited literature [21 references]  Display  Hide  Download

https://hal.inria.fr/hal-02906352
Contributor : Emmanuel Jeannot <>
Submitted on : Friday, July 24, 2020 - 3:45:47 PM
Last modification on : Monday, August 3, 2020 - 4:17:20 PM
Long-term archiving on: : Tuesday, December 1, 2020 - 6:47:49 AM

File

PDSEC-02.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02906352, version 1

Collections

Citation

Emmanuel Jeannot, Richard Sartori. Improving MPI Application Communication Time with an Introspection Monitoring Library. PDSEC 2020 - 21st IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing, May 2020, New-Orleans, United States. pp.10. ⟨hal-02906352⟩

Share

Metrics

Record views

71

Files downloads

129