Aggregations are almost always done at the top of operator tree after all selections and joins in a SQL query. But actually they can be done before joins and make later joins much cheaper when used properly. Although some enumeration algorithms considering eager aggregation are proposed, no sufficient evaluations are available to guide the adoption of this technique in practice. And no evaluations are done for real data sets and real queries with estimated cardinalities. That means it is not known how eager aggregation performs in the real world. In this thesis, a new estimation method for group by and join combining traditional estimation method and index-based join sampling is proposed and evaluated. Two enumeration algorithms considering...
Ranking is an important property that needs to be fully supported by current relational query engine...
Thesis (Ph.D.)--University of Washington, 2021As the demand for data intensive pipelines has grown a...
Approximate query processing is an adequate technique to reduce response times and system load in ca...
Aggregations are almost always done at the top of operator tree after all selections and joins in a...
Groupjoins, the combined execution of a join and a subsequent group by, are common in analytical que...
It has been a recognized fact for many years that query execution can benefit from pushing grouping ...
: We provide a new family of join algorithms, called ripple joins, for online processing of complex,...
We present a new family of join algorithms, called ripple joins, for online processing of multi-tabl...
Selectivity estimation refers to the ability of the SQL query optimizer to estimate the size of the ...
Some aggregate and grouping queries are conceptually simple, but difficult to express in SQL. This d...
The concept of time-constrained SQL queries was introduced to address the problem of long-running SQ...
Aggregates are rife in real life SQL queries. However, in the parallel query processing literature a...
Ranking is an important property that needs to be fully supported by current relational query engine...
Finding a good join order is crucial for query performance. In this paper, we introduce the Join Ord...
There is considerable overlap between strategies proposed for subquery evaluation, and those for gro...
Ranking is an important property that needs to be fully supported by current relational query engine...
Thesis (Ph.D.)--University of Washington, 2021As the demand for data intensive pipelines has grown a...
Approximate query processing is an adequate technique to reduce response times and system load in ca...
Aggregations are almost always done at the top of operator tree after all selections and joins in a...
Groupjoins, the combined execution of a join and a subsequent group by, are common in analytical que...
It has been a recognized fact for many years that query execution can benefit from pushing grouping ...
: We provide a new family of join algorithms, called ripple joins, for online processing of complex,...
We present a new family of join algorithms, called ripple joins, for online processing of multi-tabl...
Selectivity estimation refers to the ability of the SQL query optimizer to estimate the size of the ...
Some aggregate and grouping queries are conceptually simple, but difficult to express in SQL. This d...
The concept of time-constrained SQL queries was introduced to address the problem of long-running SQ...
Aggregates are rife in real life SQL queries. However, in the parallel query processing literature a...
Ranking is an important property that needs to be fully supported by current relational query engine...
Finding a good join order is crucial for query performance. In this paper, we introduce the Join Ord...
There is considerable overlap between strategies proposed for subquery evaluation, and those for gro...
Ranking is an important property that needs to be fully supported by current relational query engine...
Thesis (Ph.D.)--University of Washington, 2021As the demand for data intensive pipelines has grown a...
Approximate query processing is an adequate technique to reduce response times and system load in ca...