Commercial enterprise data warehouses are typically implemented on parallel databases due to the inherent scalability and performance limitation of a serial architecture. Queries used in such large data warehouses can contain complex predicates as well as multiple joins, and the resulting query execution plans generated by the optimizer may be sub-optimal due to mis-estimates of row cardinalities. Progressive optimization (POP) is an approach to detect cardinality estimation errors by monitoring actual cardinalities at run-time and to recover by triggering re-optimization with the actual cardinalities measured. However, the original serial POP solution is based on a serial processing architecture, and the core ideas cannot be readily applie...
Designing a Parallel Relational Data Warehouse (PRDW) consists of a set of tasks: (i) choosing the h...
. This paper describes a method for optimizing data communication and control for parallel execution...
Parallelism is a viable solution to constructing high performance object-oriented database systems. ...
Virtually every commercial query optimizer chooses the best plan for a query using a cost model that...
Physical database design is important for query performance in a shared-nothing parallel database sy...
In the current work, we derive a complete approach to optimization and automatic parallelization of ...
Shared nothing parallel data ow systems aim to bridge the gap between MapReduce and RDBMSs by combin...
Data processing systems offer an ever increasing degree of parallelism on the levels of cores, CPUs,...
Federated queries are regular relational queries accessing data on one or more remote relational or ...
The advanced productivity of the modern society has created a wide range of similar commodities. How...
Clusters are now composed of non-uniform nodes with different CPUs, disks or network cards so that c...
For complex queries in parallel database systems, substantial amounts of data must be redistributed ...
In this paper we present a new framework for studying parallel query optimization. We first note tha...
This paper describes a post-mortem call-graph profiling tool that analyses trace information generat...
A new emerging class of parallel database management systems (DBMS) is designed to take advantage of...
Designing a Parallel Relational Data Warehouse (PRDW) consists of a set of tasks: (i) choosing the h...
. This paper describes a method for optimizing data communication and control for parallel execution...
Parallelism is a viable solution to constructing high performance object-oriented database systems. ...
Virtually every commercial query optimizer chooses the best plan for a query using a cost model that...
Physical database design is important for query performance in a shared-nothing parallel database sy...
In the current work, we derive a complete approach to optimization and automatic parallelization of ...
Shared nothing parallel data ow systems aim to bridge the gap between MapReduce and RDBMSs by combin...
Data processing systems offer an ever increasing degree of parallelism on the levels of cores, CPUs,...
Federated queries are regular relational queries accessing data on one or more remote relational or ...
The advanced productivity of the modern society has created a wide range of similar commodities. How...
Clusters are now composed of non-uniform nodes with different CPUs, disks or network cards so that c...
For complex queries in parallel database systems, substantial amounts of data must be redistributed ...
In this paper we present a new framework for studying parallel query optimization. We first note tha...
This paper describes a post-mortem call-graph profiling tool that analyses trace information generat...
A new emerging class of parallel database management systems (DBMS) is designed to take advantage of...
Designing a Parallel Relational Data Warehouse (PRDW) consists of a set of tasks: (i) choosing the h...
. This paper describes a method for optimizing data communication and control for parallel execution...
Parallelism is a viable solution to constructing high performance object-oriented database systems. ...