We present some theoretical and experimental results of an important caching problem that arises frequently in data intensive scientific applications. In such applications, jobs need to process several files simultaneously, i.e., a job can only be serviced if all its needed files are present in the disk cache. The set of files requested by a job is called a file-bundle. This requirement introduces the need for cache replacement algorithms based on file-bundles rather then individual files. We show that traditional caching algorithms such Least Recently Used (LRU), and GreedyDual-Size (GDS), are not optimal in this case since they are not sensitive to file-bundles and may hold in the cache non-relevant combinations of files. In this p...
Consider the following file caching problem: in response to a sequence of requests for files, where ...
Modern storage environments are composed of a variety of devices with different performance characte...
A software RAID file system is defined as a system that distributes data redundantly across an array...
We present some theoretical and experimental results of an important caching problem that arises fr...
The file-bundle caching problem arises frequently in scientific applications where jobs need to pro...
Multi-File Caching issues arise in applications where a set of jobs are processed and each job requ...
Cache management in flexible filesystems deals with the problem of determining a cached file to be r...
We consider how to improve the performance of file caching by allowing user-level control over file ...
Grids provide an infrastructure for seamless, secure access to a globally distributed set of shared ...
The main problem in data grids is how to provide good and timely access to huge data given the limit...
International audienceCaching can effectively reduce the cost of serving content and improve the use...
When picking a cache replacement policy for file systems, LRU (Least Recently Used) has always been ...
Caching techniques have been used to improve the performance gap of storage hierarchies in computing...
Consider the following file caching problem: in response to a sequence of requests for files, where ...
A software RAID file system is defined as a system that distributes data redundantly across an array...
Consider the following file caching problem: in response to a sequence of requests for files, where ...
Modern storage environments are composed of a variety of devices with different performance characte...
A software RAID file system is defined as a system that distributes data redundantly across an array...
We present some theoretical and experimental results of an important caching problem that arises fr...
The file-bundle caching problem arises frequently in scientific applications where jobs need to pro...
Multi-File Caching issues arise in applications where a set of jobs are processed and each job requ...
Cache management in flexible filesystems deals with the problem of determining a cached file to be r...
We consider how to improve the performance of file caching by allowing user-level control over file ...
Grids provide an infrastructure for seamless, secure access to a globally distributed set of shared ...
The main problem in data grids is how to provide good and timely access to huge data given the limit...
International audienceCaching can effectively reduce the cost of serving content and improve the use...
When picking a cache replacement policy for file systems, LRU (Least Recently Used) has always been ...
Caching techniques have been used to improve the performance gap of storage hierarchies in computing...
Consider the following file caching problem: in response to a sequence of requests for files, where ...
A software RAID file system is defined as a system that distributes data redundantly across an array...
Consider the following file caching problem: in response to a sequence of requests for files, where ...
Modern storage environments are composed of a variety of devices with different performance characte...
A software RAID file system is defined as a system that distributes data redundantly across an array...