The XRootD system is used to transfer, store, and cache large datasets from high-energy physics (HEP). In this study we focus on its capability as distributed on-demand storage cache. Through exploring a large set of daily log files between 2020 and 2021, we seek to understand the data access patterns that might inform future cache design. Our study begins with a set of summary statistics regarding file read operations, file lifetimes, and file transfers. We observe that the number of read operations on each file remains nearly constant, while the average size of a read operation grows over time. Furthermore, files tend to have a consistent length of time during which they remain open and are in use. Based on this comprehensive study of the...
We present some theoretical and experimental results of an important caching problem that arises fr...
An efficient design for a distributed filesystem originates from a deep understanding of common acce...
Scientific collaborations are increasingly relying on large volumes of data for their work and many ...
The XRootD system is used to transfer, store, and cache large datasets from high-energy physics (HEP...
Large scientific collaborations often have multiple scientists accessing the same set of files while...
We assembled 18 months of transfer logs from a production High Performance Storage System (HPSS) sy...
International audienceCaching can effectively reduce the cost of serving content and improve the use...
Following the smashing success of XRootd-based USCMS data-federation, AAA project investigated exten...
User data analysis in high energy physics presents a challenge to spinning-disk based storage system...
A general problem faced by computing on the grid for opportunistic users is that while delivering op...
Large scientific projects are increasing relying on analyses of data for their new discoveries; and ...
[[abstract]]Cloud storage is a hot topic at the moment with Google's Google Storage, Microsoft's Sky...
Grids provide an infrastructure for seamless, secure access to a globally distributed set of shared ...
On-chip cache memories are instrumental in tackling several performance and energy issues facing con...
The proliferation of big-data processing platforms has already led to radically different system des...
We present some theoretical and experimental results of an important caching problem that arises fr...
An efficient design for a distributed filesystem originates from a deep understanding of common acce...
Scientific collaborations are increasingly relying on large volumes of data for their work and many ...
The XRootD system is used to transfer, store, and cache large datasets from high-energy physics (HEP...
Large scientific collaborations often have multiple scientists accessing the same set of files while...
We assembled 18 months of transfer logs from a production High Performance Storage System (HPSS) sy...
International audienceCaching can effectively reduce the cost of serving content and improve the use...
Following the smashing success of XRootd-based USCMS data-federation, AAA project investigated exten...
User data analysis in high energy physics presents a challenge to spinning-disk based storage system...
A general problem faced by computing on the grid for opportunistic users is that while delivering op...
Large scientific projects are increasing relying on analyses of data for their new discoveries; and ...
[[abstract]]Cloud storage is a hot topic at the moment with Google's Google Storage, Microsoft's Sky...
Grids provide an infrastructure for seamless, secure access to a globally distributed set of shared ...
On-chip cache memories are instrumental in tackling several performance and energy issues facing con...
The proliferation of big-data processing platforms has already led to radically different system des...
We present some theoretical and experimental results of an important caching problem that arises fr...
An efficient design for a distributed filesystem originates from a deep understanding of common acce...
Scientific collaborations are increasingly relying on large volumes of data for their work and many ...