Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Seamless Coarse Grained Parallelism Integration in Intensive Bioinformatics Workflows

Abstract : To be easily constructed, shared and maintained, complex in silico bioinformatics analysis are structured as workflows. Furthermore, the growth of computational power and storage demand from this domain, requires workflows to be efficiently executed. However, workflow performances usually rely on the ability of the designer to extract potential parallelism. But atomic bioinformatics tasks do not often exhibit direct parallelism which may appears later in the workflow design process. In this paper, we propose a Model-Driven Architecture approach for capturing the complete design process of bioinformatics workflows. More precisely, two workflow models are specified: the first one, called design model, graphically captures a low throughput prototype. The second one, called execution model, specifies multiple levels of coarse grained parallelism. The execution model is automatically generated from the design model using annotation derived from the EDAM ontology. These annotations describe the data types connecting differents elementary tasks. The execution model can then be interpreted by a workflow engine and executed on hardware having intensive computation facility.
Complete list of metadata

Cited literature [15 references]  Display  Hide  Download
Contributor : Francois Moreews Connect in order to contact the contributor
Submitted on : Wednesday, May 18, 2016 - 5:35:52 PM
Last modification on : Wednesday, November 3, 2021 - 4:34:46 AM


Files produced by the author(s)



François Moreews, Dominique Lavenier. Seamless Coarse Grained Parallelism Integration in Intensive Bioinformatics Workflows. 2016. ⟨hal-00908842⟩



Les métriques sont temporairement indisponibles