Data Management Strategies for Scientific Applications in Cloud Environments

Abstract : Clouds are increasingly being used for running dataintensive scientific applications. However, science applications need to contend with the I/O and network performance characteristics of cloud environments. Additionally, managing data effectively and efficiently over these cloud resources is challenging due to the myriad storage choices with different performance-cost trade-offs, complex application choices, complexity associated with elasticity and failure rates. In this paper, we evaluate various aspects of data management strategies in cloud environments. Our evaluation is performed in the context of two frameworks - Hadoop and FRIEDA and conducted on four cloud testbeds - FutureGrid, ExoGeni, Grid5000, Amazon. Our experiments highlight the different performance implications of storage, file system, provis
Document type :
Reports
Complete list of metadatas

https://hal.inria.fr/hal-01102191
Contributor : Christine Morin <>
Submitted on : Monday, January 12, 2015 - 11:39:31 AM
Last modification on : Thursday, August 1, 2019 - 3:18:19 PM

Identifiers

  • HAL Id : hal-01102191, version 1

Citation

Devarshi Ghoshal, Hendrix Valerie, Eugen Feller, Christine Morin, Plale Beth, et al.. Data Management Strategies for Scientific Applications in Cloud Environments. [Research Report] LBNL-6860E, Lawrence Berkeley National Laboratory, California, USA. 2014. ⟨hal-01102191⟩

Share

Metrics

Record views

469