Abstract- Declustering problems are well-known in the databases for parallel computing envi-ronments. In this paper, we propose a new similarity-based technique for declustering data. The proposed method can adapt to the available information about query distribution (e.g. size, shape and frequency) and can work with alternative atomic data-types. Furthermore, the proposed method is flexible and can work with alternative data distributions, data sizes and partition-size constraints. The method is based on max-cut partitioning of a similarity graph defined over the given set of data, under constraints on the partition sizes. It maximizes the chances that a pair of atomic data-items that are frequently accessed together by queries are allocat...
For a storage system to keep pace with increasing amounts of data, a natural solution is to deploy m...
Window functions are extremely useful and have become increasingly popular, allowing ranking, cumula...
We present a data partitioning technique for shared-nothing database systems. A unique feature of ou...
We propose a new similarity-based technique for declustering data. The proposed method can adapt to ...
Data declustering is an important issue for reducing query response times in multi-disk database sys...
Cataloged from PDF version of article.Data declustering is an important issue for reducing query res...
In a multiple disk environment it is desirable to have techniques for efficient parallel execution o...
The problem of disk declustering is to distribute data among multiple disks to reduce query response...
Most similarity search techniques map the data objects into some high-dimensional feature space. The...
Given a collection of objects, the Similarity Self-Join problem requires to discover all those pairs...
Advances in processor and network technologies have catalyzed the growth of data intensive applicati...
Most similarity search techniques map the data objects into some high-dimensional feature space. The...
Data declustering is used to minimize query response times in data intensive applications. In this t...
Similarity search is important for many data-intensive applications to identify a set of similar obj...
A similarity query is to find from a collection of items those that are similar to a given query ite...
For a storage system to keep pace with increasing amounts of data, a natural solution is to deploy m...
Window functions are extremely useful and have become increasingly popular, allowing ranking, cumula...
We present a data partitioning technique for shared-nothing database systems. A unique feature of ou...
We propose a new similarity-based technique for declustering data. The proposed method can adapt to ...
Data declustering is an important issue for reducing query response times in multi-disk database sys...
Cataloged from PDF version of article.Data declustering is an important issue for reducing query res...
In a multiple disk environment it is desirable to have techniques for efficient parallel execution o...
The problem of disk declustering is to distribute data among multiple disks to reduce query response...
Most similarity search techniques map the data objects into some high-dimensional feature space. The...
Given a collection of objects, the Similarity Self-Join problem requires to discover all those pairs...
Advances in processor and network technologies have catalyzed the growth of data intensive applicati...
Most similarity search techniques map the data objects into some high-dimensional feature space. The...
Data declustering is used to minimize query response times in data intensive applications. In this t...
Similarity search is important for many data-intensive applications to identify a set of similar obj...
A similarity query is to find from a collection of items those that are similar to a given query ite...
For a storage system to keep pace with increasing amounts of data, a natural solution is to deploy m...
Window functions are extremely useful and have become increasingly popular, allowing ranking, cumula...
We present a data partitioning technique for shared-nothing database systems. A unique feature of ou...