HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation

Using failure injection mechanisms to experiment and evaluate a hierarchical failure detector

Sébastien Monnet 1 Marin Bertier 1
1 PARIS - Programming distributed parallel systems for large scale numerical simulation
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, ENS Cachan - École normale supérieure - Cachan, Inria Rennes – Bretagne Atlantique
Abstract : Computing grids consist of a large-scale, highly-distributed hardware architecture, often built in a hierarchical way, as cluster federations. At such scales, failures are no longer exceptions, but part of the normal behavior. When designing software for grids, developers have to take failures into account, in order to be able to provide a stable service. The fault-tolerance mechanisms need to be validated and evaluated. It is therefore crucial to make experiments at a large scale, with various volatility conditions, in order to measure the impact of failures on the whole system. This paper presents an experimental tool allowing the user to control the volatility conditions during a practical evaluation of fault-tolerant systems. The tool is based on failure-injection mechanisms. We illustrate the usefulness of our tool through an evaluation of a failure detector specifically designed for hierarchical grids.
Document type :
Complete list of metadata

Cited literature [18 references]  Display  Hide  Download

Contributor : Rapport de Recherche Inria Connect in order to contact the contributor
Submitted on : Friday, May 19, 2006 - 7:30:41 PM
Last modification on : Friday, February 4, 2022 - 3:23:59 AM
Long-term archiving on: : Sunday, April 4, 2010 - 8:37:10 PM


  • HAL Id : inria-00070213, version 1


Sébastien Monnet, Marin Bertier. Using failure injection mechanisms to experiment and evaluate a hierarchical failure detector. [Research Report] RR-5811, INRIA. 2006, pp.19. ⟨inria-00070213⟩



Record views


Files downloads