Emerging Web services, such as email, photo sharing, and web site archives, must preserve large volumes of quickly accessible data indefinitely into the future. The costs of doing so often determine whether the service is economically viable. We make the case that these applications ’ demands on large scale storage systems over long time horizons require us to reevaluate traditional system designs. We examine threats to long-lived data from an end-to-end perspective, taking into account not just hardware and software faults but also faults due to humans and organizations. We present a simple model of long-term storage failures that helps us reason about various strategies for addressing some of these threats. Using this model we show that t...
With an ever-increasing volume of digital records and compliance requirements mandated by regulation...
As storage costs drop, storage is becoming the lowest cost in a digital repository – and the biggest...
An unprecedented amount of information encompassing almost every facet of human activities across th...
Preserving data for a long period of time in the face of faults, large and small, is crucial for des...
Digital material is vulnerable to loss and corruption as it is stored in magnetic and optical media ...
Failure is inevitable: disks fail, hosts crash, networks partition, applications stop. Consequently...
Most information today forgoes a solely physical medium and resides in a digital format; however, th...
Many scientific and socioeconomic reasons exist for the long term retention of scientific and lately...
Archival storage systems are designed for a write-once, read-maybe usage model which places an empha...
The drive for online access to archive content within ‘tapeless’ workflows means that mass-storage t...
As storage costs drop, storage is becoming the lowest cost in a digital repository – and the biggest...
This paper considers replication strategies for storage systems that aggregate the disks of many nod...
Digital storage is a key element not only of computing systems, but is now considered an essential c...
From genomic sequencing to weather forecasting, high-performance computing systems (HPCs) have prof...
The past two decades have seen an explosion in both the growth and roles of long-term digital archiv...
With an ever-increasing volume of digital records and compliance requirements mandated by regulation...
As storage costs drop, storage is becoming the lowest cost in a digital repository – and the biggest...
An unprecedented amount of information encompassing almost every facet of human activities across th...
Preserving data for a long period of time in the face of faults, large and small, is crucial for des...
Digital material is vulnerable to loss and corruption as it is stored in magnetic and optical media ...
Failure is inevitable: disks fail, hosts crash, networks partition, applications stop. Consequently...
Most information today forgoes a solely physical medium and resides in a digital format; however, th...
Many scientific and socioeconomic reasons exist for the long term retention of scientific and lately...
Archival storage systems are designed for a write-once, read-maybe usage model which places an empha...
The drive for online access to archive content within ‘tapeless’ workflows means that mass-storage t...
As storage costs drop, storage is becoming the lowest cost in a digital repository – and the biggest...
This paper considers replication strategies for storage systems that aggregate the disks of many nod...
Digital storage is a key element not only of computing systems, but is now considered an essential c...
From genomic sequencing to weather forecasting, high-performance computing systems (HPCs) have prof...
The past two decades have seen an explosion in both the growth and roles of long-term digital archiv...
With an ever-increasing volume of digital records and compliance requirements mandated by regulation...
As storage costs drop, storage is becoming the lowest cost in a digital repository – and the biggest...
An unprecedented amount of information encompassing almost every facet of human activities across th...