Abstract. In data storage applications, a large collection of consecutively numbered data “buckets ” are often mapped to a relatively small collection of consecutively numbered storage “bins. ” For example, in parallel database applications, buckets Correspond to hash buckets of data and bins correspond to database nodes. In disk array applications, buckets correspond to logical tracks and bins correspond to physical disks in an array. Measures of the “goodness ” of a mapping method include: (1) The rime (number of operations) needed to compute the mapping. (2) The storage needed to store a representation of the mapping. (3) The balance of the mapping, i.e., the extent to which all bins receive the same number of buckets. (4) The cost of re...
AbstractThe joint demands of high performance and fault tolerance in a large array of disks can be s...
We present a randomized block-level storage virtualization for arbitrary heterogeneous storage syste...
Bitmap indexing is a common technique for indexing high-dimensional data in data warehouses and sci...
AbstractWe study a particular aspect of the balanced allocation paradigm (also known as the “two-cho...
International audienceThis paper proposes round-hashing, which is suitable for data storage on distr...
The ever-growing amount of data requires highly scalable storage solutions. The most flexible approa...
Databases allocate and free blocks of storage on disk. Freed blocks introduce holes where no data is...
A parallel file may be physically stored on several independent disks and logically partitioned by s...
Several well known techniques for organising data so that they may be retrieved on some key attribu...
Since the divergence between the processor speed and the memory access rate is progressively increas...
Distributed databases often suffer unequal distribution of data among storage nodes, which is known ...
The problem of disk declustering is to distribute data among multiple disks to reduce query response...
Datasets in large scale scientific data management, are often modeled as k-dimensional arrays. Eleme...
The paper is regarding the fair distribution of several files having different sizes to several stor...
This paper deals with load balancing in distributed memory parallel database computers. In such an e...
AbstractThe joint demands of high performance and fault tolerance in a large array of disks can be s...
We present a randomized block-level storage virtualization for arbitrary heterogeneous storage syste...
Bitmap indexing is a common technique for indexing high-dimensional data in data warehouses and sci...
AbstractWe study a particular aspect of the balanced allocation paradigm (also known as the “two-cho...
International audienceThis paper proposes round-hashing, which is suitable for data storage on distr...
The ever-growing amount of data requires highly scalable storage solutions. The most flexible approa...
Databases allocate and free blocks of storage on disk. Freed blocks introduce holes where no data is...
A parallel file may be physically stored on several independent disks and logically partitioned by s...
Several well known techniques for organising data so that they may be retrieved on some key attribu...
Since the divergence between the processor speed and the memory access rate is progressively increas...
Distributed databases often suffer unequal distribution of data among storage nodes, which is known ...
The problem of disk declustering is to distribute data among multiple disks to reduce query response...
Datasets in large scale scientific data management, are often modeled as k-dimensional arrays. Eleme...
The paper is regarding the fair distribution of several files having different sizes to several stor...
This paper deals with load balancing in distributed memory parallel database computers. In such an e...
AbstractThe joint demands of high performance and fault tolerance in a large array of disks can be s...
We present a randomized block-level storage virtualization for arbitrary heterogeneous storage syste...
Bitmap indexing is a common technique for indexing high-dimensional data in data warehouses and sci...