Impact FD: An Unreliable Failure Detector Based on Process Relevance and Confidence in the System

Abstract : This paper presents a new unreliable failure detector, called the Impact failure detector (FD) that, contrarily to the majority of traditional FDs, outputs a trust level value which expresses the degree of confidence in the system. An impact factor is assigned to each process and the trust level is equal to the sum of the impact factors of the processes not suspected of failure. Moreover, a threshold parameter defines a lower bound value for the trust level, over which the confidence in the system is ensured. In particular, we defined a f l exi bi l i t y property that denotes the capacity of the Impact FD to tolerate a certain margin of failures or false suspicions, i.e., its capacity of considering different sets of responses that lead the system to trusted states. The Impact FD is suitable for systems that present node redundancy, heterogeneity of nodes, clustering feature, and allow a margin of failures which does not degrade the confidence in the system. The paper also includes a timer-based distributed algorithm which implements an Impact FD, as well as its proof of correctness, for systems whose links are lossy asynchronous or for those whose all (or some) links are eventually timely. Performance evaluation results, based on PlanetLab [1] traces, confirm the degree of flexible applicability of our failure detector and that, due to the accepted margin of failure, both failures and false suspicions are more tolerated when compared to traditional unreliable failure detectors.
Complete list of metadatas

Cited literature [39 references]  Display  Hide  Download

https://hal.inria.fr/hal-01793311
Contributor : Pierre Sens <>
Submitted on : Wednesday, May 16, 2018 - 2:22:38 PM
Last modification on : Friday, July 5, 2019 - 3:26:03 PM
Long-term archiving on: Tuesday, September 25, 2018 - 4:44:12 PM

File

Impact.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01793311, version 1

Citation

Anubis Graciela de Moraes Rossetto, Claudio Geyer, Luciana Arantes, Pierre Sens. Impact FD: An Unreliable Failure Detector Based on Process Relevance and Confidence in the System. The Computer Journal, Oxford University Press (UK), In press. ⟨hal-01793311⟩

Share

Metrics

Record views

164

Files downloads

72