Abstract—Digital archives are growing rapidly, necessitating stronger reliability measures than RAID to avoid data loss from device failure. Mirroring, a popular solution, is too expensive over time. We present a compromise solution that uses multi-level redundancy coding to reduce the probability of data loss from multiple simultaneous device failures. This approach handles small-scale failures of one or two devices efficiently while still allowing the system to survive rare-event, larger-scale failures of four or more devices. In our approach, each disk is split into a set of fixed size disklets which are used to construct reliability stripes. To protect against rare event failures, reliability stripes are grouped into larger super-groups...
Archiving and systematic backup of large digital data generates a quick demand for multi-petabyte sc...
We propose the use of parity-based redundant data layouts of increasing reliability as a means to pr...
Abstract—We present new methods to extend data reliability of disks in RAID systems for applications...
Digital archives are growing rapidly, necessitating stronger reliability measures than RAID to avoid...
Preserving data for a long period of time in the face of faults, large and small, is crucial for des...
Large-scale storage systems need to provide the right amount of redundancy in their storage scheme t...
Batch-correlated failures result from the manifestation of a common defect in most, if not all, disk...
Fault-tolerant disk arrays rely on replication or erasure-coding to reconstruct lost data after a di...
As we look toward exascale it is clear that high-capacity HPC storage systems will incorporate the l...
Reliability and availability are increasingly important in large-scale storage systems built from th...
Abstract—Disk failure rates vary so widely among different makes and models that designing storage s...
Inter-file compression techniques store files as sets of references to data objects or chunks that c...
Abstract — RAID has long been established as an effective way to provide highly reliable as well as ...
Large archival storage systems experience long periods of idleness broken up by rare data accesses. ...
Abstract—Archival data storage systems contain data that must be preserved over long periods of time...
Archiving and systematic backup of large digital data generates a quick demand for multi-petabyte sc...
We propose the use of parity-based redundant data layouts of increasing reliability as a means to pr...
Abstract—We present new methods to extend data reliability of disks in RAID systems for applications...
Digital archives are growing rapidly, necessitating stronger reliability measures than RAID to avoid...
Preserving data for a long period of time in the face of faults, large and small, is crucial for des...
Large-scale storage systems need to provide the right amount of redundancy in their storage scheme t...
Batch-correlated failures result from the manifestation of a common defect in most, if not all, disk...
Fault-tolerant disk arrays rely on replication or erasure-coding to reconstruct lost data after a di...
As we look toward exascale it is clear that high-capacity HPC storage systems will incorporate the l...
Reliability and availability are increasingly important in large-scale storage systems built from th...
Abstract—Disk failure rates vary so widely among different makes and models that designing storage s...
Inter-file compression techniques store files as sets of references to data objects or chunks that c...
Abstract — RAID has long been established as an effective way to provide highly reliable as well as ...
Large archival storage systems experience long periods of idleness broken up by rare data accesses. ...
Abstract—Archival data storage systems contain data that must be preserved over long periods of time...
Archiving and systematic backup of large digital data generates a quick demand for multi-petabyte sc...
We propose the use of parity-based redundant data layouts of increasing reliability as a means to pr...
Abstract—We present new methods to extend data reliability of disks in RAID systems for applications...