An efficient design for a distributed filesystem originates from a deep understanding of common access patterns and user behavior which is obtained through a deep analysis of traces and snapshots. In this paper we analyze traces for eight distributed filesystems that represent a mix of workloads taken from educational, research and commercial environments. We focused on characterizing block access patterns, amount of block sharing and working set size over long periods of time, and we tried to find common behaviors for all workloads that can be generalized to other storage systems. We found that most environments shared large amounts of blocks over time, and that block sharing was significantly affected by repetitive human behavior. We als...
Secret sharing-based distributed storage systems can provide long-term protection of confidentiality...
We compare and contract long-term file system activity for different Unix environments for periods o...
International audienceThis paper proposes a new data placement policy to allocate data blocks across...
The past two decades have seen an explosion in both the growth and roles of long-term digital archiv...
In this paper, we describe the collection and analysis of file system traces from a variety of diffe...
The goal of this project is to propose improvements in long-term disk read performance for large-sca...
File system studies are critical to the accurate configuration, design, and continued evolution of s...
Scientific collaborations are increasingly relying on large volumes of data for their work and many ...
The XRootD system is used to transfer, store, and cache large datasets from high-energy physics (HEP...
v Abstract Caching has long been recognized as a powerful performance enhancement technique in many...
This thesis explores ways in which intermediate cache servers affect the performance and scalability...
We assembled 18 months of transfer logs from a production High Performance Storage System (HPSS) sys...
As mass storage technology becomes more affordable for sites smaller than supercomputer centers, un...
Block correlations are common semantic patterns in storage systems. They can be exploited for im-pro...
Large scientific collaborations often have multiple scientists accessing the same set of files while...
Secret sharing-based distributed storage systems can provide long-term protection of confidentiality...
We compare and contract long-term file system activity for different Unix environments for periods o...
International audienceThis paper proposes a new data placement policy to allocate data blocks across...
The past two decades have seen an explosion in both the growth and roles of long-term digital archiv...
In this paper, we describe the collection and analysis of file system traces from a variety of diffe...
The goal of this project is to propose improvements in long-term disk read performance for large-sca...
File system studies are critical to the accurate configuration, design, and continued evolution of s...
Scientific collaborations are increasingly relying on large volumes of data for their work and many ...
The XRootD system is used to transfer, store, and cache large datasets from high-energy physics (HEP...
v Abstract Caching has long been recognized as a powerful performance enhancement technique in many...
This thesis explores ways in which intermediate cache servers affect the performance and scalability...
We assembled 18 months of transfer logs from a production High Performance Storage System (HPSS) sys...
As mass storage technology becomes more affordable for sites smaller than supercomputer centers, un...
Block correlations are common semantic patterns in storage systems. They can be exploited for im-pro...
Large scientific collaborations often have multiple scientists accessing the same set of files while...
Secret sharing-based distributed storage systems can provide long-term protection of confidentiality...
We compare and contract long-term file system activity for different Unix environments for periods o...
International audienceThis paper proposes a new data placement policy to allocate data blocks across...