Skip to Main content Skip to Navigation
New interface
Conference papers

(Re)Use in Public Scientific Workflow Repositories

Johannes Starlinger 1 Sarah Cohen-Boulakia 2, 3 Ulf Leser 1 
3 AMIB - Algorithms and Models for Integrative Biology
LIX - Laboratoire d'informatique de l'École polytechnique [Palaiseau], LRI - Laboratoire de Recherche en Informatique, UP11 - Université Paris-Sud - Paris 11, Inria Saclay - Ile de France
Abstract : Scientific workflows have been introduced to enhance reproducibility, sharing and reuse of in-silico experiments. Faced with increasing numbers of workflows available in public repositories, users have a crucial need for assistance in workflow discovery. Identifying the functional elements shared between workflows and thus determining similarity between workflows is then a key point. In this paper, we present the results of a study we performed on 898 workflows from myExperiment. Our contribution is four fold: (i) we discuss the critical problem of identifying workflows and workflow elements, (ii) we provide detailed analysis about the frequencies of re-used elements across workflows, (iii) we consider, for the first time, the problem of cross-author reuse and (iv) we highlight characteristics shared between reused elements.
Complete list of metadata
Contributor : Sarah Cohen-Boulakia Connect in order to contact the contributor
Submitted on : Saturday, November 3, 2012 - 5:22:44 PM
Last modification on : Tuesday, November 22, 2022 - 11:10:08 AM


  • HAL Id : hal-00748029, version 1


Johannes Starlinger, Sarah Cohen-Boulakia, Ulf Leser. (Re)Use in Public Scientific Workflow Repositories. Scientific and Statistical Database Management - 24th International Conference, SSDBM 2012, Jun 2012, Chania, Greece. pp.361-378. ⟨hal-00748029⟩



Record views