A Method of Deduplication for Data Remote Backup

Abstract : The paper describes the Remote Data Disaster Recovery System using Hash to identify and avoid sending duplicate data blocks between the Primary Node and the Secondary Node, thereby, to reduce the data replication network bandwidth, decrease overhead and improve network efficiency. On both nodes, some extra storage spaces (the Hash Repositories) besides data disks are used to record the Hash for each data block on data disks. We extend the data replication protocol between the Primary Node and the Secondary Node. When the data, whose Hash exists in the Hash Repository, is duplication, the block address is transferred instead of the data, and that reduces network bandwidth requirement, saves synchronization time, and improves network efficiency.
Document type :
Conference papers
Complete list of metadatas

Cited literature [15 references]  Display  Hide  Download

https://hal.inria.fr/hal-01559563
Contributor : Hal Ifip <>
Submitted on : Monday, July 10, 2017 - 5:27:56 PM
Last modification on : Tuesday, July 18, 2017 - 3:30:46 PM
Long-term archiving on: Wednesday, January 24, 2018 - 6:49:49 PM

File

978-3-642-18333-1_10_Chapter.p...
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Jingyu Liu, Yu-An Tan, Yuanzhang Li, Xuelan Zhang, Zexiang Zhou. A Method of Deduplication for Data Remote Backup. 4th Conference on Computer and Computing Technologies in Agriculture (CCTA), Oct 2010, Nanchang, China. pp.68-75, ⟨10.1007/978-3-642-18333-1_10⟩. ⟨hal-01559563⟩

Share

Metrics

Record views

73

Files downloads

96