Skip to Main content Skip to Navigation

Data Management Strategies for Scientific Applications in Cloud Environments

Abstract : Clouds are increasingly being used for running dataintensive scientific applications. However, science applications need to contend with the I/O and network performance characteristics of cloud environments. Additionally, managing data effectively and efficiently over these cloud resources is challenging due to the myriad storage choices with different performance-cost trade-offs, complex application choices, complexity associated with elasticity and failure rates. In this paper, we evaluate various aspects of data management strategies in cloud environments. Our evaluation is performed in the context of two frameworks - Hadoop and FRIEDA and conducted on four cloud testbeds - FutureGrid, ExoGeni, Grid5000, Amazon. Our experiments highlight the different performance implications of storage, file system, provis
Document type :
Complete list of metadata
Contributor : Christine Morin Connect in order to contact the contributor
Submitted on : Monday, January 12, 2015 - 11:39:31 AM
Last modification on : Tuesday, October 19, 2021 - 11:58:54 PM


  • HAL Id : hal-01102191, version 1


Devarshi Ghoshal, Hendrix Valerie, Eugen Feller, Christine Morin, Plale Beth, et al.. Data Management Strategies for Scientific Applications in Cloud Environments. [Research Report] LBNL-6860E, Lawrence Berkeley National Laboratory, California, USA. 2014. ⟨hal-01102191⟩



Record views