In shared-disk database systems, disk access has to be scheduled properly to avoid unnecessary contention between processors. The first part of this report studies the allocation of intermediate results of join queries (buckets) on disk and derives heuristics to determine the number of processing nodes and disks to employ. Using an analytical model, we show that declustering should be applied even for single buckets to ensure optimal performance. In the second part, we consider the order of reading the buckets and demonstrate the necessity of highly dynamic load balancing to prevent excessive disk contention, especially under skew conditions
Shared nothing multiprocessor architecture is known to be more scalable to support very large databa...
We investigate various load balancing approaches for hash-based join techniques popular in multicomp...
Join is the most important and expensive operation in relational databases. The parallel join operat...
In shared-disk database systems, disk access has to be scheduled properly to avoid unnecessary conte...
For complex queries in parallel database systems, substantial amounts of data must be redistributed ...
Skew effects are still a significant problem for efficient query processing in parallel database sys...
Shared Disk database systems offer a high flexibility for parallel transaction and query processing....
Shared-disk database systems offer a high degree of freedom in the allocation of workload compared t...
Dynamic load balancing is a prerequisite for effectively utilizing large parallel database systems. ...
Shared Disk database systems offer a high flexibility for parallel transaction and query processing....
A consensus on parallel architecture for very large database management has emerged. This architectu...
Abstract. A consensus on parallel architecture for very large database manage-ment has emerged. This...
Parallel database systems have to support the effective parallelization of complex queries in multi-...
Shared nothing multiprocessor architecture is known to be more scalable to support very large databa...
We investigate various load balancing approaches for hash-based join techniques popular in multicomp...
Join is the most important and expensive operation in relational databases. The parallel join operat...
In shared-disk database systems, disk access has to be scheduled properly to avoid unnecessary conte...
For complex queries in parallel database systems, substantial amounts of data must be redistributed ...
Skew effects are still a significant problem for efficient query processing in parallel database sys...
Shared Disk database systems offer a high flexibility for parallel transaction and query processing....
Shared-disk database systems offer a high degree of freedom in the allocation of workload compared t...
Dynamic load balancing is a prerequisite for effectively utilizing large parallel database systems. ...
Shared Disk database systems offer a high flexibility for parallel transaction and query processing....
A consensus on parallel architecture for very large database management has emerged. This architectu...
Abstract. A consensus on parallel architecture for very large database manage-ment has emerged. This...
Parallel database systems have to support the effective parallelization of complex queries in multi-...
Shared nothing multiprocessor architecture is known to be more scalable to support very large databa...
We investigate various load balancing approaches for hash-based join techniques popular in multicomp...
Join is the most important and expensive operation in relational databases. The parallel join operat...