CSHFt: A Composite Fault-tolerant Architecture and Self-adaptable Hierarchical Fault-tolerant Strategy for Satellite System
Nowadays, building parallel system with highperformance commercial off-the-shelf (COTS) chips becomes the main way to improve satellite system performance greatly. System reliability is a tough issue which needs to be solved by more effective fault-tolerant scheme. Centralized faulttolerant scheme has the risk of single point of failure (SPOF), while distributed fault-tolerant scheme is much complex and introduces large overhead. Both of these traditional methods have their own drawbacks. This paper proposes a composite self-adaptable hierarchical fault-tolerant (CSHFt) scheme which effectively integrates and expands the ideas of centralized and distributed fault-tolerant methods. It constructs a composite and symmetrical system architecture supporting both fault-tolerant methods simultaneously and realizes the self-adaptable hierarchical fault-tolerant strategy. CSHFt scheme executes system fault tolerance in the sequence of ‘first centralized then distributed’. System switches its faulttolerant mode actively according to its performing history. By combining and completing two traditional methods, CSHFt scheme eliminates the risk of SPOF, reduces the overall complexity and overhead of system fault tolerance and enhances system’s real-time feature. System failure only occurs when all the system nodes are broken, which makes system highly reliable. Based on the prototype system, we verify the practical CSHFt scheme. Its performance is also analyzed and evaluated.
satellite system fault-tolerant architecture composite architecture fault-tolerant strategy hierarchical strategy self-adaptable
Hao Zhou Jingfei Jiang
College of Computer Science National University of Defense Technology Changsha, China
国际会议
无锡
英文
333-337
2011-10-14(万方平台首次上网日期,不代表论文的发表时间)