Assessing the Performance of MPI Applications Through Time-Independent Trace Replay

Frédéric Desprez 1 George Markomanolis 1 Martin Quinson 2 Frédéric Suter 3
1 GRAAL - Algorithms and Scheduling for Distributed Heterogeneous Platforms
Inria Grenoble - Rhône-Alpes, LIP - Laboratoire de l'Informatique du Parallélisme
2 ALGORILLE - Algorithms for the Grid
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : Simulation is a popular approach to obtain objective performance indicators platforms that are not at one's disposal. It may help the dimensioning of compute clusters in large computing centers. In this work we present a framework for the off-line simulation of MPI applications. Its main originality with regard to the literature is to rely on time-independent execution traces. This allows us to completely decouple the acquisition process from the actual replay of the traces in a simulation context. Then we are able to acquire traces for large application instances without being limited to an execution on a single compute cluster. Finally our framework is built on top of a scalable, fast, and validated simulation kernel. In this paper, we present the used time-independent trace format, investigate several acquisition strategies, detail the developed trace replay tool, and assess the quality of our simulation framework in terms of accuracy, acquisition time, simulation time, and trace size.
Complete list of metadatas

Cited literature [23 references]  Display  Hide  Download

https://hal.inria.fr/inria-00546992
Contributor : Martin Quinson <>
Submitted on : Wednesday, December 15, 2010 - 12:09:09 PM
Last modification on : Wednesday, April 17, 2019 - 4:07:57 PM
Long-term archiving on : Monday, November 5, 2012 - 1:51:02 PM

File

RR-7489.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00546992, version 1

Citation

Frédéric Desprez, George Markomanolis, Martin Quinson, Frédéric Suter. Assessing the Performance of MPI Applications Through Time-Independent Trace Replay. Second International Workshop on Parallel Software Tools and Tool Infrastructures (PSTI 2011) Held in conjunction with ICPP 2011, the 40th International Conference on Parallel Processing, Sep 2011, Taipei, Taiwan. ⟨inria-00546992⟩

Share

Metrics

Record views

880

Files downloads

463