For complex queries in parallel database systems, substantial amounts of data must be redistributed between operators executed on different processing nodes. Frequently, such intermediate results cannot be held in main memory and must be stored on disk. To limit the ensuing performance penalty, a data allocation must be found that supports parallel I/O to the greatest possible extent. In this paper, we propose declustering even self-contained units of temporary data processed in a single operation (such as individual buckets of parallel hash joins) across multiple disks. Using a suitable analytical model, we find that the improvement of parallel I/O outweighs the penalty of increased fragmentation
Physical database design is important for query performance in a shared-nothing parallel database sy...
Parallel database systems have to support the effective parallelization of complex queries in multi-...
Cataloged from PDF version of article.Data declustering is an important issue for reducing query res...
For complex queries in parallel database systems, substantial amounts of data must be redistributed ...
In shared-disk database systems, disk access has to be scheduled properly to avoid unnecessary conte...
Shared Disk database systems offer a high flexibility for parallel transaction and query processing....
Shared Disk database systems offer a high flexibility for parallel transaction and query processing....
Declustering is a well known strategy to achieve maximum I/O parallelism in multi-disk systems. Many...
We present a formal analysis of the database layout problem, i.e., the problem of determining how da...
We present a data partitioning technique for shared-nothing database systems. A unique feature of ou...
Dynamic load balancing is a prerequisite for effectively utilizing large parallel database systems. ...
The problem of disk declustering is to distribute data among multiple disks to reduce query response...
Shared-disk database systems offer a high degree of freedom in the allocation of workload compared t...
Several algorithms for parallel disk systems have appeared in the literature recently, and they are ...
Declustering is a well known strategy to achieve maximum I/O parallelism in multi-disk systems. Many...
Physical database design is important for query performance in a shared-nothing parallel database sy...
Parallel database systems have to support the effective parallelization of complex queries in multi-...
Cataloged from PDF version of article.Data declustering is an important issue for reducing query res...
For complex queries in parallel database systems, substantial amounts of data must be redistributed ...
In shared-disk database systems, disk access has to be scheduled properly to avoid unnecessary conte...
Shared Disk database systems offer a high flexibility for parallel transaction and query processing....
Shared Disk database systems offer a high flexibility for parallel transaction and query processing....
Declustering is a well known strategy to achieve maximum I/O parallelism in multi-disk systems. Many...
We present a formal analysis of the database layout problem, i.e., the problem of determining how da...
We present a data partitioning technique for shared-nothing database systems. A unique feature of ou...
Dynamic load balancing is a prerequisite for effectively utilizing large parallel database systems. ...
The problem of disk declustering is to distribute data among multiple disks to reduce query response...
Shared-disk database systems offer a high degree of freedom in the allocation of workload compared t...
Several algorithms for parallel disk systems have appeared in the literature recently, and they are ...
Declustering is a well known strategy to achieve maximum I/O parallelism in multi-disk systems. Many...
Physical database design is important for query performance in a shared-nothing parallel database sy...
Parallel database systems have to support the effective parallelization of complex queries in multi-...
Cataloged from PDF version of article.Data declustering is an important issue for reducing query res...