A Method of Deduplication for Data Remote Backup

摘要：

The paper describes the Remote Data Disaster Recovery System us-ing Hash to identify and avoid sending duplicate data blocks between the Pri-mary Node and the Secondary Node, thereby, to reduce the data replication network bandwidth, decrease overhead and improve network efficiency. On both nodes, some extra storage spaces (the Hash Repositories) besides data disks are used to record the Hash for each data block on data disks. We extend the data replication protocol between the Primary Node and the Secondary Node. When the data, whose Hash exists in the Hash Repository, is duplication, the block address is transferred instead of the data, and that reduces network bandwidth requirement, saves synchronization time, and improves network efficiency.

关键词： Disaster Recovery Deduplication Hash Duplicate Data

作者: Jingyu Liu Yu-an Tan Yuanzhang Li Xuelan Zhang Zexiang Zhou

作者单位: School of Computer Science and Technology, Beijing Institute of Technology,Beijing, 100081, P.R. Chi School of Computer Science and Technology, Beijing Institute of Technology,Beijing, 100081, P.R. Chi Toyou Feiji Electronics CO, LTD, Beijing, 100081, P.R. China

会议类型: 国际会议

会议名称: The 4th IFIP International on Computer and Computing Technologies in Agriculture and the 4th Symposium on Development of Rural Information(第四届国际计算机及计算机技术在农业中的应用研讨会暨第四届中国农业信息化发展论坛 CCTA 2010)

会议地点: 南昌

会议语种:英文

页码: 68-75

在线出版日期: 2010-10-22（万方平台首次上网日期，不代表论文的发表时间）

会议专题

A Method of Deduplication for Data Remote Backup