Data Management Strategies for Scientific Applications in Cloud Environments - Archive ouverte HAL Access content directly
Reports (Research Report) Year : 2014

Data Management Strategies for Scientific Applications in Cloud Environments

(1) , (2) , (3) , (3) , (1) , (4)
1
2
3
4

Abstract

Clouds are increasingly being used for running dataintensive scientific applications. However, science applications need to contend with the I/O and network performance characteristics of cloud environments. Additionally, managing data effectively and efficiently over these cloud resources is challenging due to the myriad storage choices with different performance-cost trade-offs, complex application choices, complexity associated with elasticity and failure rates. In this paper, we evaluate various aspects of data management strategies in cloud environments. Our evaluation is performed in the context of two frameworks - Hadoop and FRIEDA and conducted on four cloud testbeds - FutureGrid, ExoGeni, Grid5000, Amazon. Our experiments highlight the different performance implications of storage, file system, provis
Not file

Dates and versions

hal-01102191 , version 1 (12-01-2015)

Identifiers

  • HAL Id : hal-01102191 , version 1

Cite

Devarshi Ghoshal, Hendrix Valerie, Eugen Feller, Christine Morin, Plale Beth, et al.. Data Management Strategies for Scientific Applications in Cloud Environments. [Research Report] LBNL-6860E, Lawrence Berkeley National Laboratory, California, USA. 2014. ⟨hal-01102191⟩
216 View
0 Download

Share

Gmail Facebook Twitter LinkedIn More