Versatile, Scalable, and Accurate Simulation of Distributed Applications and Platforms

Abstract : The study of parallel and distributed applications and platforms, whether in the cluster, grid, peer-to-peer, volunteer, or cloud computing domain, often mandates empirical evaluation of proposed algorithmic and system solutions via simulation. Unlike direct experimentation via an application deployment on a real-world testbed, simulation enables fully repeatable and configurable experiments for arbitrary hypothetical scenarios. Two key concerns are accuracy (so that simulation results are scientifically sound) and scalability (so that simulation experiments can be fast and memory-efficient). While the scalability of a simulator is easily measured, the accuracy of many state-of-the-art simulators is largely unknown because they have not been sufficiently validated. In this work we describe recent accuracy and scalability advances made in the context of the SimGrid simulation framework. A design goal of SimGrid is that it should be versatile, i.e., applicable across all aforementioned domains. We present quantitative results that show that SimGrid compares favorably to state-of-the-art domain-specific simulators in terms of scalability, accuracy, or the trade-off between the two. An important implication is that, contrary to popular wisdom, striving for versatility in a simulator is not an impediment but instead is conducive to improving both accuracy and scalability.
Complete list of metadatas

Cited literature [79 references]  Display  Hide  Download

https://hal.inria.fr/hal-01017319
Contributor : Arnaud Legrand <>
Submitted on : Saturday, August 23, 2014 - 4:09:57 PM
Last modification on : Thursday, August 1, 2019 - 10:42:20 AM
Long-term archiving on : Thursday, November 27, 2014 - 1:57:06 PM

File

simgrid3-journal.pdf
Files produced by the author(s)

Identifiers

Citation

Henri Casanova, Arnaud Giersch, Arnaud Legrand, Martin Quinson, Frédéric Suter. Versatile, Scalable, and Accurate Simulation of Distributed Applications and Platforms. Journal of Parallel and Distributed Computing, Elsevier, 2014, 74 (10), pp.2899-2917. ⟨10.1016/j.jpdc.2014.06.008⟩. ⟨hal-01017319v2⟩

Share

Metrics

Record views

1964

Files downloads

2410