The XRootD system is used to transfer, store, and cache large datasets from high-energy physics (HEP). In this study we focus on its capability as distributed on-demand storage cache. Through exploring a large set of daily log files between 2020 and 2021, we seek to understand the data access patterns that might inform future cache design. Our study begins with a set of summary statistics regarding file read operations, file lifetimes, and file transfers. We observe that the number of read operations on each file remains nearly constant, while the average size of a read operation grows over time. Furthermore, files tend to have a consistent length of time during which they remain open and are in use. Based on this comprehensive study of the...
Scientific domains such as Climate Science, High Energy Particle Physics (HEP) and others, routinely...
In recent years, high-end computing has undergone two significant changes: (1) an increasing focus o...
The projected Storage and Compute needs for the HL-LHC will be a factor up to 10 above what can be a...
The XRootD system is used to transfer, store, and cache large datasets from high-energy physics (HEP...
Large scientific collaborations often have multiple scientists accessing the same set of files while...
We assembled 18 months of transfer logs from a production High Performance Storage System (HPSS) sy...
International audienceCaching can effectively reduce the cost of serving content and improve the use...
Scientific collaborations are increasingly relying on large volumes of data for their work and many ...
User data analysis in high energy physics presents a challenge to spinning-disk based storage system...
Large scientific projects are increasing relying on analyses of data for their new discoveries; and ...
An efficient design for a distributed filesystem originates from a deep understanding of common acce...
Following the smashing success of XRootd-based USCMS data-federation, AAA project investigated exten...
Grids provide an infrastructure for seamless, secure access to a globally distributed set of shared ...
The proliferation of big-data processing platforms has already led to radically different system des...
[[abstract]]Cloud storage is a hot topic at the moment with Google's Google Storage, Microsoft's Sky...
Scientific domains such as Climate Science, High Energy Particle Physics (HEP) and others, routinely...
In recent years, high-end computing has undergone two significant changes: (1) an increasing focus o...
The projected Storage and Compute needs for the HL-LHC will be a factor up to 10 above what can be a...
The XRootD system is used to transfer, store, and cache large datasets from high-energy physics (HEP...
Large scientific collaborations often have multiple scientists accessing the same set of files while...
We assembled 18 months of transfer logs from a production High Performance Storage System (HPSS) sy...
International audienceCaching can effectively reduce the cost of serving content and improve the use...
Scientific collaborations are increasingly relying on large volumes of data for their work and many ...
User data analysis in high energy physics presents a challenge to spinning-disk based storage system...
Large scientific projects are increasing relying on analyses of data for their new discoveries; and ...
An efficient design for a distributed filesystem originates from a deep understanding of common acce...
Following the smashing success of XRootd-based USCMS data-federation, AAA project investigated exten...
Grids provide an infrastructure for seamless, secure access to a globally distributed set of shared ...
The proliferation of big-data processing platforms has already led to radically different system des...
[[abstract]]Cloud storage is a hot topic at the moment with Google's Google Storage, Microsoft's Sky...
Scientific domains such as Climate Science, High Energy Particle Physics (HEP) and others, routinely...
In recent years, high-end computing has undergone two significant changes: (1) an increasing focus o...
The projected Storage and Compute needs for the HL-LHC will be a factor up to 10 above what can be a...