Impact: an Unreliable Failure Detector Based on Processes' Relevance and the Confidence Degree in the System - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2016

Impact: an Unreliable Failure Detector Based on Processes' Relevance and the Confidence Degree in the System

Résumé

This technical report presents a new unreliable failure detector, called the Impact failure detector (FD) that, contrarily to the majority of traditional FDs, outputs a trust level value which expresses the degree of confidence in the system. An impact factor is assigned to each node and the trust level is equal to the sum of the impact factors of the nodes not suspected of failure. Moreover, a threshold parameter defines a lower bound value for the trust level, over which the confidence in the system is ensured. In particular, we defined a flexibility property that denotes the capacity of the Impact FD to tolerate a certain margin of failures or false suspicions, i.e., its capacity of considering different sets of responses that lead the system to trusted states. The Impact FD is suitable for systems that present node redundancy, heterogeneity of nodes, clustering feature, and allow a margin of failures which does not degrade the confidence in the system. The technical report also includes a timer-based distributed algorithm which implements a Impact FD, as well as its proof of correctness, for systems whose links are lossy asynchronous or for those whose all (or some) links are eventually timely. Performance evaluation results based on real PlanetLab traces confirm the degree of flexible applicability of our failure detector and, due to the accepted margin of failure, the both failures and false suspicions are more tolerated when compared to traditional unreliable failure detectors. We also show the equivalence of some classes of Impact FD in regard with Sigma and Omega classes, which are fundamental classes to circumvent the impossibility of consensus in asynchronous message-passing distributed systems.
Fichier principal
Vignette du fichier
TechReport_Impact.pdf (1.07 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01136595 , version 1 (30-03-2015)
hal-01136595 , version 2 (14-04-2015)
hal-01136595 , version 3 (08-09-2016)

Identifiants

  • HAL Id : hal-01136595 , version 3

Citer

Anubis G. M. Rossetto, Luciana Arantes, Pierre Sens, Claudio R. Geyer. Impact: an Unreliable Failure Detector Based on Processes' Relevance and the Confidence Degree in the System. [Research Report] Université Pierre et Marie Curie; INRIA Paris-Rocquencourt - Regal; Universidade Federal do Rio Grande do Sul. 2016. ⟨hal-01136595v3⟩
237 Consultations
199 Téléchargements

Partager

Gmail Facebook X LinkedIn More