(Re)Use in Public Scientific Workflow Repositories

Johannes Starlinger 1 Sarah Cohen-Boulakia 2, 3 Ulf Leser 1
3 AMIB - Algorithms and Models for Integrative Biology
LIX - Laboratoire d'informatique de l'École polytechnique [Palaiseau], LRI - Laboratoire de Recherche en Informatique, UP11 - Université Paris-Sud - Paris 11, Inria Saclay - Ile de France
Abstract : Scientific workflows have been introduced to enhance reproducibility, sharing and reuse of in-silico experiments. Faced with increasing numbers of workflows available in public repositories, users have a crucial need for assistance in workflow discovery. Identifying the functional elements shared between workflows and thus determining similarity between workflows is then a key point. In this paper, we present the results of a study we performed on 898 workflows from myExperiment. Our contribution is four fold: (i) we discuss the critical problem of identifying workflows and workflow elements, (ii) we provide detailed analysis about the frequencies of re-used elements across workflows, (iii) we consider, for the first time, the problem of cross-author reuse and (iv) we highlight characteristics shared between reused elements.
Complete list of metadatas

https://hal.inria.fr/hal-00748029
Contributor : Sarah Cohen-Boulakia <>
Submitted on : Saturday, November 3, 2012 - 5:22:44 PM
Last modification on : Wednesday, March 27, 2019 - 4:41:29 PM

Identifiers

  • HAL Id : hal-00748029, version 1

Collections

Citation

Johannes Starlinger, Sarah Cohen-Boulakia, Ulf Leser. (Re)Use in Public Scientific Workflow Repositories. Scientific and Statistical Database Management - 24th International Conference, SSDBM 2012, Jun 2012, Chania, Greece. pp.361-378. ⟨hal-00748029⟩

Share

Metrics

Record views

388