Large scientific collaborations often have multiple scientists accessing the same set of files while doing different analyses, which create repeated accesses to the large amounts of shared data located far away. These data accesses have long latency due to distance and occupy the limited bandwidth available over the wide-area network. To reduce the wide-area network traffic and the data access latency, regional data storage caches have been installed as a new networking service. To study the effectiveness of such a cache system in scientific applications, we examine the Southern California Petabyte Scale Cache for a high-energy physics experiment. By examining about 3TB of operational logs, we show that this cache removed 67.6% of file requ...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Cache injection is a viable technique to improve the performance of data-intensive parallel applicat...
Memcache is a distributed in-memory data store designed to reduce database load for web applications...
Scientific collaborations are increasingly relying on large volumes of data for their work and many ...
The volume of data moving through a network increases with new scientific experiments and simulation...
The XRootD system is used to transfer, store, and cache large datasets from high-energy physics (HEP...
The potential for improving the performance of data-intensive scientific programs by enhancing data ...
International audienceCaching can effectively reduce the cost of serving content and improve the use...
The XRootD system is used to transfer, store, and cache large datasets from high-energy physics (HEP...
Large distributed storage systems such as High Performance Computing (HPC) systems used by national ...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
This paper evaluates network caching as a means to improve the performance of cluster-based multipro...
This paper evaluates the benefit of adding a shared cache to the network interface as a means of imp...
The Hadoop Distributed File System (HDFS) is a network file system used to support multiple widely-u...
Exponential link bandwidth increase over the past decade has sparked off interest in increasingly co...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Cache injection is a viable technique to improve the performance of data-intensive parallel applicat...
Memcache is a distributed in-memory data store designed to reduce database load for web applications...
Scientific collaborations are increasingly relying on large volumes of data for their work and many ...
The volume of data moving through a network increases with new scientific experiments and simulation...
The XRootD system is used to transfer, store, and cache large datasets from high-energy physics (HEP...
The potential for improving the performance of data-intensive scientific programs by enhancing data ...
International audienceCaching can effectively reduce the cost of serving content and improve the use...
The XRootD system is used to transfer, store, and cache large datasets from high-energy physics (HEP...
Large distributed storage systems such as High Performance Computing (HPC) systems used by national ...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
This paper evaluates network caching as a means to improve the performance of cluster-based multipro...
This paper evaluates the benefit of adding a shared cache to the network interface as a means of imp...
The Hadoop Distributed File System (HDFS) is a network file system used to support multiple widely-u...
Exponential link bandwidth increase over the past decade has sparked off interest in increasingly co...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Cache injection is a viable technique to improve the performance of data-intensive parallel applicat...
Memcache is a distributed in-memory data store designed to reduce database load for web applications...