Using Active Data to Provide Smart Data Surveillance to E-Science Users

Abstract : Modern scientific experiments often involve multiple storage and computing platforms, software tools, and analysis scripts. The resulting heterogeneous environments make data management operations challenging; the significant number of events and the absence of data integration makes it difficult to track data provenance, manage sophisticated analysis processes, and recover from unexpected situations. Current approaches often require costly human intervention and are inherently error prone. The difficulties inherent in managing and manipulating such large and highly distributed datasets also limits automated sharing and collaboration. We study a real world e-Science application involving terabytes of data, using three different analysis and storage platforms, and a number of applications and analysis processes. We demonstrate that using a specialized data life cycle and programming model - Active Data - we can easily implement global progress monitoring, and sharing; recover from unexpected events; and automate a range of tasks.
Type de document :
Communication dans un congrès
23rd Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), Mar 2015, Turku, Finland. 2015, 〈10.1109/PDP.2015.76〉
Liste complète des métadonnées

Littérature citée [8 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01256207
Contributeur : Gilles Fedak <>
Soumis le : jeudi 14 janvier 2016 - 15:00:48
Dernière modification le : vendredi 20 avril 2018 - 15:44:26
Document(s) archivé(s) le : vendredi 11 novembre 2016 - 05:58:46

Fichier

active-data.euromicropdp.2015....
Fichiers produits par l'(les) auteur(s)

Licence


Distributed under a Creative Commons Paternité - Pas d'utilisation commerciale - Partage selon les Conditions Initiales 4.0 International License

Identifiants

Collections

Citation

Anthony Simonet, Kyle Chard, Gilles Fedak, Ian Foster. Using Active Data to Provide Smart Data Surveillance to E-Science Users. 23rd Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), Mar 2015, Turku, Finland. 2015, 〈10.1109/PDP.2015.76〉. 〈hal-01256207〉

Partager

Métriques

Consultations de la notice

323

Téléchargements de fichiers

70