Skip to Main content Skip to Navigation
Conference papers

Message relaying techniques for computational grids and their relations to fault tolerant message passing for the Grid

Michaël Cadilhac 1 Thomas Herault 1 Pierre Lemarinier 1
1 GRAND-LARGE - Global parallel and distributed computing
CNRS - Centre National de la Recherche Scientifique : UMR8623, Inria Saclay - Ile de France, UP11 - Université Paris-Sud - Paris 11, LIFL - Laboratoire d'Informatique Fondamentale de Lille, LRI - Laboratoire de Recherche en Informatique
Abstract : In order to execute without modi cation Message Passing distributed applications on a computational grid, one has to address many issues. The rst to come is how let processes of two di erent clusters communicate. In this work, we study the performances of relaying techniques (passing messages to a middle-tier) to solve this issue. When using relays, messages and most of the nondeterministic behavior of nodes pass through the relays during the execution. This provides the ability to implement fault tolerance at the relay level using pessimistic message logging techniques.We also evaluate the overhead of this logging and study how relays should be designed and fault tolerance protocols composed to provide a full fault-tolerant Message Passing Interface library for computational grids.
Document type :
Conference papers
Complete list of metadata

https://hal.inria.fr/hal-00700236
Contributor : Ist Rennes Connect in order to contact the contributor
Submitted on : Tuesday, May 22, 2012 - 3:21:09 PM
Last modification on : Thursday, July 8, 2021 - 3:49:26 AM

Identifiers

  • HAL Id : hal-00700236, version 1

Collections

Citation

Michaël Cadilhac, Thomas Herault, Pierre Lemarinier. Message relaying techniques for computational grids and their relations to fault tolerant message passing for the Grid. 2nd CoreGRID Workshop on GRID and Peer to Peer Systems Architecture, Jan 2006, Paris, France. ⟨hal-00700236⟩

Share

Metrics

Record views

360