A Low-overhead Cooperative Failure Detector
Failure detectors are one of the fundamental components for ensuring the high availability of large scale distributed system.The increasing popularity and demand for the large scale distributed system came with an increase in the overhead and complexity of failure detection that posed a challenge obstructing further development.In order to solve the challenge,this paper proposes a new failure detector—S-AFD which combines adaptive failure detection based on QoS (quality of service) and cooperative mechanism that share negative messages among different active nodes.It does not only reduce the detection overhead,but also adapt the various network conditions.Through analysis of experiments,it is shown that the performance of S-AFD has a clearly improvement compared with the traditional implementations of failure detectors.
large scale distributed system adaptive failure detection cooperative mechanism quality of service
Jiaxi Liu Jian Dong Zhibo Wu Jin Wu Jinghui Lan Jiaxin Yu
School of Computer Science and Technology Harbin Institute of Technology Harbin, China Beijing Research Institute of Near Space Vehicle System Engineering Beijing, China School of Information Science and Engineering Yanshan University Qinhuangdao, China
国际会议
秦皇岛
英文
811-815
2015-09-18(万方平台首次上网日期,不代表论文的发表时间)