Data Management Strategies for Scientific Applications in Cloud Environments

Abstract : Clouds are increasingly being used for running dataintensive scientific applications. However, science applications need to contend with the I/O and network performance characteristics of cloud environments. Additionally, managing data effectively and efficiently over these cloud resources is challenging due to the myriad storage choices with different performance-cost trade-offs, complex application choices, complexity associated with elasticity and failure rates. In this paper, we evaluate various aspects of data management strategies in cloud environments. Our evaluation is performed in the context of two frameworks - Hadoop and FRIEDA and conducted on four cloud testbeds - FutureGrid, ExoGeni, Grid5000, Amazon. Our experiments highlight the different performance implications of storage, file system, provis
Type de document :
Rapport
[Research Report] LBNL-6860E, Lawrence Berkeley National Laboratory, California, USA. 2014
Liste complète des métadonnées

https://hal.inria.fr/hal-01102191
Contributeur : Christine Morin <>
Soumis le : lundi 12 janvier 2015 - 11:39:31
Dernière modification le : mardi 14 août 2018 - 17:10:02

Identifiants

  • HAL Id : hal-01102191, version 1

Citation

Devarshi Ghoshal, Hendrix Valerie, Eugen Feller, Christine Morin, Plale Beth, et al.. Data Management Strategies for Scientific Applications in Cloud Environments. [Research Report] LBNL-6860E, Lawrence Berkeley National Laboratory, California, USA. 2014. 〈hal-01102191〉

Partager

Métriques

Consultations de la notice

432