Towards Complete Tracking of Provenance in Experimental Distributed Systems Research

Tomasz Buchert 1 Lucas Nussbaum 1, * Jens Gustedt 2
* Corresponding author
1 MADYNES - Management of dynamic networks and services
Inria Nancy - Grand Est, LORIA - NSS - Department of Networks, Systems and Services
2 CAMUS - Compilation pour les Architectures MUlti-coeurS
Inria Nancy - Grand Est, ICube - Laboratoire des sciences de l'ingénieur, de l'informatique et de l'imagerie
Abstract : Running experiments on modern systems like supercomput-ers, cloud infrastructures or P2P networks became very complex, both technically and methodologically. It is difficult to rerun an experiment or understand its results even with technical background on the technology and methods used. Storing the provenance of experimental data, i.e., storing information about how the results were produced, proved to be a powerful tool to address similar problems in computational natural sciences. In this paper, we (1) survey provenance collection in various domains of computer science, (2) introduce a new classification of prove-nance types, and (3) sketch a design of a provenance system inspired by this classification.
Liste complète des métadonnées

Cited literature [35 references]  Display  Hide  Download

https://hal.inria.fr/hal-01191855
Contributor : Lucas Nussbaum <>
Submitted on : Friday, September 4, 2015 - 8:14:13 AM
Last modification on : Thursday, February 7, 2019 - 5:34:49 PM
Document(s) archivé(s) le : Friday, May 5, 2017 - 12:16:35 PM

Files

provenance.pdf
Publisher files allowed on an open archive

Identifiers

  • HAL Id : hal-01191855, version 2

Citation

Tomasz Buchert, Lucas Nussbaum, Jens Gustedt. Towards Complete Tracking of Provenance in Experimental Distributed Systems Research. REPPAR - Second International Workshop on Reproducibility in Parallel Computing -- held together with Euro-Par, Aug 2015, Vienna, Austria. ⟨hal-01191855v2⟩

Share

Metrics

Record views

512

Files downloads

495