How to bring together fault tolerance and data consistency to enable grid data sharing

Gabriel Antoniu; Jean-François Deverge; Sébastien Monnet

Article Dans Une Revue Concurrency and Computation: Practice and Experience Année : 2006

How to bring together fault tolerance and data consistency to enable grid data sharing

(1) , (1) , (1)

Gabriel Antoniu

Fonction : Auteur
PersonId : 746326
IdHAL : gabriel-antoniu
ORCID : 0000-0001-6525-3736
IdRef : 095615296

Programming distributed parallel systems for large scale numerical simulation

Jean-François Deverge

Fonction : Auteur

Programming distributed parallel systems for large scale numerical simulation

Sébastien Monnet

Fonction : Auteur
PersonId : 831800

Programming distributed parallel systems for large scale numerical simulation

Résumé

This paper addresses the challenge of transparent data sharing within computing grids built as cluster federations. On such platforms, the availability of storage resources may change in a dynamic way, often due to hardware failures. We focus on the problem of handling the consistency of replicated data in the presence of failures. We propose a software architecture which decouples consistency management from fault tolerance management. We illustrate this architecture with a case study showing how to design a consistency protocol using fault-tolerant building blocks. As a proof of concept, we describe a prototype implementation of this protocol within JuxMem, a software experimental platform for grid data sharing, and we report on a preliminary experimental evaluation of the proposed approach.

Mots clés

PEER-TO-PEER JXTA JUXMEM

Domaines

Calcul parallèle, distribué et partagé [cs.DC]

Fichier principal

AntDevMon06CPE.pdf (473.95 Ko)

Sébastien Monnet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00000987

Soumis le : mercredi 11 janvier 2006-09:38:55

Dernière modification le : vendredi 24 mars 2023-14:52:47

Archivage à long terme le : lundi 20 septembre 2010-14:00:06

Dates et versions

inria-00000987 , version 1 (10-01-2006)

inria-00000987 , version 2 (11-01-2006)

Identifiants

HAL Id : inria-00000987 , version 2

Citer

Gabriel Antoniu, Jean-François Deverge, Sébastien Monnet. How to bring together fault tolerance and data consistency to enable grid data sharing. Concurrency and Computation: Practice and Experience, 2006, Concurrency and Computation: Practice and Experience, 17, pp.1-19. ⟨inria-00000987v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 CNRS INRIA ENS-CACHAN INSA-RENNES IRISA GRID5000 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE SILECS UR1-MATH-NUM ENS-PARIS-SACLAY

336 Consultations

295 Téléchargements

How to bring together fault tolerance and data consistency to enable grid data sharing

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager