Inter-file compression techniques store files as sets of references to data objects or chunks that can be shared among many files. While these techniques can achieve much better compression ratios than conventional intra-file compression methods such as Lempel-Ziv compression, they also reduce the reliability of the storage system because the loss of a few critical chunks can lead to the loss of many files. We show how to eliminate this problem by choosing for each chunk a replication level that is a function of the amount of data that would be lost if that chunk were lost. Experiments using actual archival data show that our technique can achieve significantly higher robustness than a conventional approach combining data mirroring and intr...
Many computer systems; especially in corporations, contain large amount of documents such as letters...
Fault-tolerant disk arrays rely on replication or erasure-coding to reconstruct lost data after a di...
Many computer systems; especially in corporations, contain large amount of documents such as letters...
Inter-file compression techniques store files as sets of references to data objects or chunks that c...
Abstract—Digital archives are growing rapidly, necessitating stronger reliability measures than RAID...
Digital archives are growing rapidly, necessitating stronger reliability measures than RAID to avoid...
We present the design of the Deep Store archival stor-age architecture, a large-scale storage system...
The ever-increasing volume of archival data that needs to be reliably retained for long periods of t...
Abstract — Peer-to-peer distributed storage systems pro-vide reliable access to data through redunda...
Given the vast volume of data that needs to be stored reliably, many data-centers and large-scale f...
Ongoing advancements in technology lead to everincreasing storage capacities. In spite of this, opti...
Data is often replicated in distributed systems to improve availability and performance. This replic...
Redundancy in information theory is defined as the number of bits used to transmit a message minus t...
There is a huge amount of duplicated or redundant data in current storage systems. So Data De-duplic...
Şefik Şuayb Arslan (MEF Author)##nofulltext##An erasure-coded archival file storage system is presen...
Many computer systems; especially in corporations, contain large amount of documents such as letters...
Fault-tolerant disk arrays rely on replication or erasure-coding to reconstruct lost data after a di...
Many computer systems; especially in corporations, contain large amount of documents such as letters...
Inter-file compression techniques store files as sets of references to data objects or chunks that c...
Abstract—Digital archives are growing rapidly, necessitating stronger reliability measures than RAID...
Digital archives are growing rapidly, necessitating stronger reliability measures than RAID to avoid...
We present the design of the Deep Store archival stor-age architecture, a large-scale storage system...
The ever-increasing volume of archival data that needs to be reliably retained for long periods of t...
Abstract — Peer-to-peer distributed storage systems pro-vide reliable access to data through redunda...
Given the vast volume of data that needs to be stored reliably, many data-centers and large-scale f...
Ongoing advancements in technology lead to everincreasing storage capacities. In spite of this, opti...
Data is often replicated in distributed systems to improve availability and performance. This replic...
Redundancy in information theory is defined as the number of bits used to transmit a message minus t...
There is a huge amount of duplicated or redundant data in current storage systems. So Data De-duplic...
Şefik Şuayb Arslan (MEF Author)##nofulltext##An erasure-coded archival file storage system is presen...
Many computer systems; especially in corporations, contain large amount of documents such as letters...
Fault-tolerant disk arrays rely on replication or erasure-coding to reconstruct lost data after a di...
Many computer systems; especially in corporations, contain large amount of documents such as letters...