Parallelization of Scientific Workflows in the Cloud

Abstract : Nowadays, more and more scientific experiments need to handle massive amounts of data. Their data processing consists of multiple computational steps and dependencies within them. A data-intensive scientific workflow is an appropriate tool for modeling such process. Since the execution of data-intensive scientific workflows requires large-scale computing and storage resources, a cloud environment, which provides virtually infinite resources is appealing. However, because of the general geographical distribution of scientific groups collaborating in the experiments, multisite management of data-intensive scientific workflows in the cloud is becoming an important problem. This paper presents a general study of the current state of the art of data-intensive scientific workflow execution in the cloud and corresponding multisite management techniques.
Liste complète des métadonnées

Cited literature [119 references]  Display  Hide  Download

https://hal.inria.fr/hal-01024101
Contributor : Ji Liu <>
Submitted on : Thursday, July 17, 2014 - 5:08:41 PM
Last modification on : Friday, March 15, 2019 - 1:15:01 AM
Document(s) archivé(s) le : Monday, November 24, 2014 - 7:05:24 PM

File

RR-8565.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01024101, version 2

Collections

Citation

Ji Liu, Esther Pacitti, Patrick Valduriez, Marta Mattoso. Parallelization of Scientific Workflows in the Cloud. [Research Report] RR-8565, INRIA. 2014. ⟨hal-01024101v2⟩

Share

Metrics

Record views

1269

Files downloads

815