Smart Ring: A Model of Node Failure Detection in High Available Cloud Data Center

Abstract : Nowadays most of cloud data centers deploy high available system in order to provide continuous services, so it’s very important for a high available cluster to detect the node failure (physical machine failure) accurately and timely in a low bandwidth occupation way. However, compared to the traditional cluster environment, the scale of cloud data center increases rapidly with the use of virtualization, so traditional node failure detection models have already faced several new problems. In this paper, we present a three roles and two layers node failure detection model, named as Smart Ring, which fits cloud data center well and strikes a balance between accuracy, instantaneity and bandwidth occupation. It can simultaneously detect the status of physical machines and virtual machines and deal well with multiple nodes failure and network partition. Our experiment results show that Smart Ring has a better performance than most existing models.
Document type :
Conference papers
Complete list of metadatas

Cited literature [12 references]  Display  Hide  Download

https://hal.inria.fr/hal-01551364
Contributor : Hal Ifip <>
Submitted on : Friday, June 30, 2017 - 10:36:12 AM
Last modification on : Friday, December 1, 2017 - 1:09:56 AM
Long-term archiving on : Monday, January 22, 2018 - 7:29:09 PM

File

978-3-642-35606-3_33_Chapter.p...
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Lei Xu, Wenzhi Chen, Zonghui Wang, Huafei Ni, Jiajie Wu. Smart Ring: A Model of Node Failure Detection in High Available Cloud Data Center. 9th International Conference on Network and Parallel Computing (NPC), Sep 2012, Gwangju, South Korea. pp.279-288, ⟨10.1007/978-3-642-35606-3_33⟩. ⟨hal-01551364⟩

Share

Metrics

Record views

127

Files downloads

126