In this work, we focus on optimizing the deduplication system by adjusting the pertinent factors in fingerprint lookup and chunking, the factors which we identify as the key ingredients of efficient deduplication. For efficient fingerprint lookup, we propose fingerprint management scheme called LRU-based Index Partitioning. For efficient chunking, we propose Incremental Modulo-K(INC-K) algorithm which is optimized Rabin's algorithm where we significantly reduce the number of arithmetic operations exploiting the algebraic nature of modulo arithmetic. LRU-based Index Partitioning uses the notion of tablet and enforces access locality of the fingerprint lookup in storing fingerprints. We maintain tablets with LRU manner to exploit temporal...
Deduplication technologies are increasingly being de-ployed to reduce cost and increase space-effici...
University of Minnesota Ph.D. dissertation. January 2012. Major: Computer science. Advisor: Prof. Da...
Abstract—Deduplication is a commonly-used technique on disk-based storage pools. However, deduplicat...
Due to the quick increase in digital data, especially in mobile usage and social media, data dedupli...
Data deduplication has become a populartechnology for reducing the amount of storagespace necessary ...
Part 9: StorageInternational audienceData deduplication is an effective method to reduce data storag...
Abstract—Data deduplication is an essential and critical com-ponent of backup systems. Essential, be...
Storage deduplication has received recent interest in the research community. In scenarios where the...
Abstract. Large backup and restore systems may have a petabyte or more data in their repository. Suc...
Data deduplication describes a class of approaches that reduce the storage capacity needed to store ...
Data deduplication is an essential and critical component of backup systems. Essential, because it r...
Data deduplication systems discover and remove redundancies between data blocks. The search for redu...
In NAND Flash-based SSDs, deduplication can provide an effective resolution of three critical issues...
International audienceThis paper is dedicated to data deduplication algorithms and models that lead ...
The benefits provided by cloud computing and the space savings offered by data deduplication make it...
Deduplication technologies are increasingly being de-ployed to reduce cost and increase space-effici...
University of Minnesota Ph.D. dissertation. January 2012. Major: Computer science. Advisor: Prof. Da...
Abstract—Deduplication is a commonly-used technique on disk-based storage pools. However, deduplicat...
Due to the quick increase in digital data, especially in mobile usage and social media, data dedupli...
Data deduplication has become a populartechnology for reducing the amount of storagespace necessary ...
Part 9: StorageInternational audienceData deduplication is an effective method to reduce data storag...
Abstract—Data deduplication is an essential and critical com-ponent of backup systems. Essential, be...
Storage deduplication has received recent interest in the research community. In scenarios where the...
Abstract. Large backup and restore systems may have a petabyte or more data in their repository. Suc...
Data deduplication describes a class of approaches that reduce the storage capacity needed to store ...
Data deduplication is an essential and critical component of backup systems. Essential, because it r...
Data deduplication systems discover and remove redundancies between data blocks. The search for redu...
In NAND Flash-based SSDs, deduplication can provide an effective resolution of three critical issues...
International audienceThis paper is dedicated to data deduplication algorithms and models that lead ...
The benefits provided by cloud computing and the space savings offered by data deduplication make it...
Deduplication technologies are increasingly being de-ployed to reduce cost and increase space-effici...
University of Minnesota Ph.D. dissertation. January 2012. Major: Computer science. Advisor: Prof. Da...
Abstract—Deduplication is a commonly-used technique on disk-based storage pools. However, deduplicat...