Data warehouse queries pose challenging performance problems that often necessitate the use of parallel database systems (PDBS). Although dynamic load balancing is of key importance in PDBS, to our knowledge it has not yet been investigated thoroughly for parallel data warehouses. In this study, we propose a scheduling strategy that simultaneously considers both processors and disks while utilizing the load balancing potential of a Shared Disk architecture. We compare the performance of this new method to several other approaches in a comprehensive simulation study, incorporating skew aspects and typical data warehouse features such as star schemas
Shared Disk database systems offer a high flexibility for parallel transaction and query processing....
We consider the execution of multi-join queries in a hierarchical parallel system, i.e., a shared-no...
Clusters are now composed of non-uniform nodes with different CPUs, disks or network cards so that c...
Data warehouse queries pose challenging performance problems that often necessitate the use of paral...
Dynamic load balancing is a prerequisite for effectively utilizing large parallel database systems. ...
Parallel database systems have to support the effective parallelization of complex queries in multi-...
Shared-disk database systems offer a high degree of freedom in the allocation of workload compared t...
Data allocation is a key performance factor for parallel database systems (PDBS). This holds especia...
International audienceDefinition : The goal of parallel query execution is minimizing query response...
Parallel database systems have to support the effective parallelization of complex queries in multi-...
Skew effects are a serious problem in parallel database systems, but the relationship between differ...
This paper presents a multidimensional schema, called the multidimensional range tree (MDR-tree), to...
In shared-disk database systems, disk access has to be scheduled properly to avoid unnecessary conte...
In this paper, we investigate two scheduling approaches for multicomputer-based parallel database sy...
Amount of data stored in enterprises are increasing rapidly. Volume of data stored in database is ap...
Shared Disk database systems offer a high flexibility for parallel transaction and query processing....
We consider the execution of multi-join queries in a hierarchical parallel system, i.e., a shared-no...
Clusters are now composed of non-uniform nodes with different CPUs, disks or network cards so that c...
Data warehouse queries pose challenging performance problems that often necessitate the use of paral...
Dynamic load balancing is a prerequisite for effectively utilizing large parallel database systems. ...
Parallel database systems have to support the effective parallelization of complex queries in multi-...
Shared-disk database systems offer a high degree of freedom in the allocation of workload compared t...
Data allocation is a key performance factor for parallel database systems (PDBS). This holds especia...
International audienceDefinition : The goal of parallel query execution is minimizing query response...
Parallel database systems have to support the effective parallelization of complex queries in multi-...
Skew effects are a serious problem in parallel database systems, but the relationship between differ...
This paper presents a multidimensional schema, called the multidimensional range tree (MDR-tree), to...
In shared-disk database systems, disk access has to be scheduled properly to avoid unnecessary conte...
In this paper, we investigate two scheduling approaches for multicomputer-based parallel database sy...
Amount of data stored in enterprises are increasing rapidly. Volume of data stored in database is ap...
Shared Disk database systems offer a high flexibility for parallel transaction and query processing....
We consider the execution of multi-join queries in a hierarchical parallel system, i.e., a shared-no...
Clusters are now composed of non-uniform nodes with different CPUs, disks or network cards so that c...