OBIWEE : an open source bioinformatics cloud environment

François Moreews; Jonathan Piat; Olivier Sallou

Communication Dans Un Congrès Année : 2011

OBIWEE : an open source bioinformatics cloud environment

(1, 2) , (1) , (1)

1
2

François Moreews

Fonction : Auteur
PersonId : 748874
IdHAL : francois-moreews
ORCID : 0000-0002-4168-4459
IdRef : 192464035

Biological systems and models, bioinformatics and sequences

Système d'Information des GENomes des Animaux d'Elevage

Jonathan Piat

Fonction : Auteur

Biological systems and models, bioinformatics and sequences

Olivier Sallou

Fonction : Auteur
PersonId : 740653
IdHAL : osallou
IdRef : 253130611

Biological systems and models, bioinformatics and sequences

Résumé

Bioinformatics applications are often structured as workflows that are composed of a set of operations to perform on large data sets. These workflows are deployed as complex scripts that handle the sequence of program calls with their relevant inputs and try to take advantage of a computer cluster using a scheduler. Their performances rely on the user ability to analyze the potential parallelism in the workflow. SLICEE (Service Layer for Intensive Computation Execution Environment) abstracts the scheduler cluster calls by handling command submission, parallelism extraction and data management. A workflow client orchestrates the SLICEE services that exploit the data parallelism, and takes care of the data routing between tasks. Thus the workflow tasks execution takes advantage of the parallelism available on the cluster with minimum user intervention. Maintaining a cluster architecture is expensive and its processing power is hard to scale over time. Cloud computing proposes to virtualize a computer architecture and to deploy it on available physical computing resources. Therefore the physical architecture is shared and the virtual processing power can be scaled to meet the user demand. OBIWEE (On Demand Bioinformatics Intensive Workflow Execution Environment) is a bioinformatics intensive workflow execution environment preconfigured on a linux virtual cluster, that can be deployed either on a private cloud or a public cloud service like Amazon EC2. The virtual cluster architecture is scaled to meet the workflow requirement, and a master node is running the SLICEE middleware. Each node of the cluster is running a bioinformatics specific Linux distribution to provide access to a wide range of bioinformatics applications. The virtual cluster has been tested on a private cloud using OpenNebula and KVM, following step is Amazon EC2 integration. All steps, starting from the cluster configuration to the workflow design and execution are performed through a web browser. The open source OBIWEE bioinformatics cloud service has been designed to allow groups with low IT support or poor computing infrastructure to analyze their own data. It also helps at facing the increasing demand for bioinformatics intensive treatments, in a context of large dissemination of sequencing technologies usages.

Mots clés

cloud computing SOA workflow bioinformatics parallelism

Domaines

Bio-informatique [q-bio.QM] Bio-Informatique, Biologie Systémique [q-bio.QM]

Francois Moreews : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00638715

Soumis le : lundi 7 novembre 2011-10:59:54

Dernière modification le : vendredi 24 mars 2023-14:52:55

Dates et versions

inria-00638715 , version 1 (07-11-2011)

Identifiants

HAL Id : inria-00638715 , version 1
PRODINRA : 245989

Citer

François Moreews, Jonathan Piat, Olivier Sallou. OBIWEE : an open source bioinformatics cloud environment. BOSC 2011 - 12th Annual Bioinformatics Open Source Conference, Jul 2011, Vienne, Austria. ⟨inria-00638715⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES INRA IRISA IRISA-D7 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE INRAE ANR UR1-MATH-NUM

174 Consultations

0 Téléchargements

OBIWEE : an open source bioinformatics cloud environment

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager