We assembled 18 months of transfer logs from a production High Performance Storage System (HPSS) system at the National Energy Research Scientific Computing Center(NERSC) and analyzed them to assess workload behavior and gain some insight into which cache configurations would provide the best service to the users. We found, as expected, that the workload is distributed over file size with a declining number of files as the files get larger, so the amount of space consumed per file size increment is roughly constant up to file sizes of 1 GB. Sixty one percent of file accesses were write accesses. There are a significant number of files written which are never read -- backup files and similar files. For all sizes of files, access frequencies ...
File system studies are critical to the accurate configuration, design, and continued evolution of s...
I/O performance is a critical aspect of data-intensive scientific computing. We seek to advance the ...
With the software applications increasing in complexity, description of hardware is becoming increas...
We assembled 18 months of transfer logs from a production High Performance Storage System (HPSS) sy...
The XRootD system is used to transfer, store, and cache large datasets from high-energy physics (HEP...
In this paper, we describe the collection and analysis of file system traces from a variety of diffe...
The XRootD system is used to transfer, store, and cache large datasets from high-energy physics (HEP...
Large scientific collaborations often have multiple scientists accessing the same set of files while...
This thesis explores ways in which intermediate cache servers affect the performance and scalability...
As per-core CPU performance plateaus and data-bound applications like graph analytics and key-value ...
This thesis describes the effect of write caching on overall file system performance. It will show t...
Current network file system protocols rely heavily on a central server to coordinate file activity a...
The proliferation of big-data processing platforms has already led to radically different system des...
An efficient design for a distributed filesystem originates from a deep understanding of common acce...
Parallel scientific applications require high-performance I/O support from underlying file systems. ...
File system studies are critical to the accurate configuration, design, and continued evolution of s...
I/O performance is a critical aspect of data-intensive scientific computing. We seek to advance the ...
With the software applications increasing in complexity, description of hardware is becoming increas...
We assembled 18 months of transfer logs from a production High Performance Storage System (HPSS) sy...
The XRootD system is used to transfer, store, and cache large datasets from high-energy physics (HEP...
In this paper, we describe the collection and analysis of file system traces from a variety of diffe...
The XRootD system is used to transfer, store, and cache large datasets from high-energy physics (HEP...
Large scientific collaborations often have multiple scientists accessing the same set of files while...
This thesis explores ways in which intermediate cache servers affect the performance and scalability...
As per-core CPU performance plateaus and data-bound applications like graph analytics and key-value ...
This thesis describes the effect of write caching on overall file system performance. It will show t...
Current network file system protocols rely heavily on a central server to coordinate file activity a...
The proliferation of big-data processing platforms has already led to radically different system des...
An efficient design for a distributed filesystem originates from a deep understanding of common acce...
Parallel scientific applications require high-performance I/O support from underlying file systems. ...
File system studies are critical to the accurate configuration, design, and continued evolution of s...
I/O performance is a critical aspect of data-intensive scientific computing. We seek to advance the ...
With the software applications increasing in complexity, description of hardware is becoming increas...