Scientific Workflow Rewriting while Preserving Provenance

Sarah Cohen-Boulakia 1, 2 Christine Froidevaux 1, 2 Jiuqiang Chen 1, 2, 3
2 AMIB - Algorithms and Models for Integrative Biology
LIX - Laboratoire d'informatique de l'École polytechnique [Palaiseau], LRI - Laboratoire de Recherche en Informatique, UP11 - Université Paris-Sud - Paris 11, Inria Saclay - Ile de France
Abstract : Scientific workflow systems are numerous and equipped of provenance modules able to collect data produced and consumed during workflow runs to enhance reproducibility. An increasing number of approaches have been developed to help managing provenance information. Some of them are able to process data in a polynomial time but they require workflows to have series-parallel (SP) structures. Rewriting any workflow into an SP workflow is thus particularly important. In this paper, (i) we introduce the concept of provenance-equivalent rewriting process, (ii) we review existing graph transformations, (iii) we design the provenance-equivalent SPFlow algorithm, (iv) we evaluate our approach over a thousand of real workflows.
Type de document :
Communication dans un congrès
Proc. of the 8th IEEE International Conference in eScience, Oct 2012, Chicago, United States. IEEE, 2012
Liste complète des métadonnées

https://hal.inria.fr/hal-00748032
Contributeur : Sarah Cohen-Boulakia <>
Soumis le : samedi 3 novembre 2012 - 17:50:47
Dernière modification le : mercredi 14 novembre 2018 - 16:08:06

Identifiants

  • HAL Id : hal-00748032, version 1

Collections

Citation

Sarah Cohen-Boulakia, Christine Froidevaux, Jiuqiang Chen. Scientific Workflow Rewriting while Preserving Provenance. Proc. of the 8th IEEE International Conference in eScience, Oct 2012, Chicago, United States. IEEE, 2012. 〈hal-00748032〉

Partager

Métriques

Consultations de la notice

306