We propose a new similarity-based technique for declustering data. The proposed method can adapt to available information about query distributions, data distributions, data sizes and partition-size constraints. The method is based on max-cut partitioning of a similarity graph defined over the given set of data, under constraints on the partition sizes. It maximizes the chances that a pair of data-items that are to be accessed together by queries are allocated to distinct disks. We show that the proposed method can achieve optimal speed-up for a query-set, if there exists any other declustering method which will achieve the optimal speed-up. Experiments in parallelizing Grid Files show that the proposed method outperforms mappingfunction -b...
AbstractThe declustering problem is to allocate given data on parallel working storage devices in su...
The declustering problem is to allocate given data on parallel working storage devices in such a man...
The declustering problem is to allocate given data on parallel working storage devices in such a man...
Abstract- Declustering problems are well-known in the databases for parallel computing envi-ronments...
Data declustering is an important issue for reducing query response times in multi-disk database sys...
Cataloged from PDF version of article.Data declustering is an important issue for reducing query res...
Efficient storage and retrieval of large multidimensional datasets is an important concern for large...
The problem of disk declustering is to distribute data among multiple disks to reduce query response...
Advances in processor and network technologies have catalyzed the growth of data intensive applicati...
Declustering is a well known strategy to achieve maximum I/O parallelism in multi-disk systems. Many...
Most similarity search techniques map the data objects into some high-dimensional feature space. The...
Data declustering is used to minimize query response times in data intensive applications. In this t...
© Springer-Verlag Berlin Heidelberg 2002.In this study, parallel implementation of M-tree to index h...
Declustering is a well known strategy to achieve maximum I/O parallelism in multi-disk systems. Many...
Most similarity search techniques map the data objects into some high-dimensional feature space. The...
AbstractThe declustering problem is to allocate given data on parallel working storage devices in su...
The declustering problem is to allocate given data on parallel working storage devices in such a man...
The declustering problem is to allocate given data on parallel working storage devices in such a man...
Abstract- Declustering problems are well-known in the databases for parallel computing envi-ronments...
Data declustering is an important issue for reducing query response times in multi-disk database sys...
Cataloged from PDF version of article.Data declustering is an important issue for reducing query res...
Efficient storage and retrieval of large multidimensional datasets is an important concern for large...
The problem of disk declustering is to distribute data among multiple disks to reduce query response...
Advances in processor and network technologies have catalyzed the growth of data intensive applicati...
Declustering is a well known strategy to achieve maximum I/O parallelism in multi-disk systems. Many...
Most similarity search techniques map the data objects into some high-dimensional feature space. The...
Data declustering is used to minimize query response times in data intensive applications. In this t...
© Springer-Verlag Berlin Heidelberg 2002.In this study, parallel implementation of M-tree to index h...
Declustering is a well known strategy to achieve maximum I/O parallelism in multi-disk systems. Many...
Most similarity search techniques map the data objects into some high-dimensional feature space. The...
AbstractThe declustering problem is to allocate given data on parallel working storage devices in su...
The declustering problem is to allocate given data on parallel working storage devices in such a man...
The declustering problem is to allocate given data on parallel working storage devices in such a man...