Groupjoins, the combined execution of a join and a subsequent group by, are common in analytical queries, and occur in about 1/8 of the queries in TPC-H and TPC-DS. While they were originally invented to improve performance, efficient parallel execution of groupjoins can be limited by contention, which limits their usefulness in a many-core system. Having an efficient implementation of groupjoins is highly desirable, as groupjoins are not only used to fuse group by and join but are also introduced by the unnesting component of the query optimizer to avoid nested-loops evaluation of aggregates. Furthermore, the query optimizer needs be able to reason over the result of aggregation in order to schedule it correctly. Traditional selectivity an...
Users and administrators of large-scale infrastructures (e.g., datacenters and PlanetLab) are freque...
Since the introduction of cost-based query op-timization by Selinger et al. in their semi-nal paper,...
In SPARQL, conjunctive queries are expressed by using shared variables across sets of triple pattern...
It has been a recognized fact for many years that query execution can benefit from pushing grouping ...
Aggregations are almost always done at the top of operator tree after all selections and joins in a ...
Queries containing aggregate functions often combine multiple tables through join operations. This q...
This paper presents a principled framework for efficient processing of ad-hoc top-k (ranking) aggreg...
We propose a new class of algorithms that can be used to speed up the execution of multi-way join qu...
In this paper, we develop a principled framework for efficient processing of ad-hoc top-k (ranking) ...
There is considerable overlap between strategies proposed for subquery evaluation, and those for gro...
Thesis (Ph.D.)--University of Washington, 2021As the demand for data intensive pipelines has grown a...
Aggregates are rife in real life SQL queries. However, in the parallel query processing literature a...
Summarization: An emerging challenge in modern distributed querying is to effi- ciently process mult...
Some aggregate and grouping queries are conceptually simple, but difficult to express in SQL. This d...
: We provide a new family of join algorithms, called ripple joins, for online processing of complex,...
Users and administrators of large-scale infrastructures (e.g., datacenters and PlanetLab) are freque...
Since the introduction of cost-based query op-timization by Selinger et al. in their semi-nal paper,...
In SPARQL, conjunctive queries are expressed by using shared variables across sets of triple pattern...
It has been a recognized fact for many years that query execution can benefit from pushing grouping ...
Aggregations are almost always done at the top of operator tree after all selections and joins in a ...
Queries containing aggregate functions often combine multiple tables through join operations. This q...
This paper presents a principled framework for efficient processing of ad-hoc top-k (ranking) aggreg...
We propose a new class of algorithms that can be used to speed up the execution of multi-way join qu...
In this paper, we develop a principled framework for efficient processing of ad-hoc top-k (ranking) ...
There is considerable overlap between strategies proposed for subquery evaluation, and those for gro...
Thesis (Ph.D.)--University of Washington, 2021As the demand for data intensive pipelines has grown a...
Aggregates are rife in real life SQL queries. However, in the parallel query processing literature a...
Summarization: An emerging challenge in modern distributed querying is to effi- ciently process mult...
Some aggregate and grouping queries are conceptually simple, but difficult to express in SQL. This d...
: We provide a new family of join algorithms, called ripple joins, for online processing of complex,...
Users and administrators of large-scale infrastructures (e.g., datacenters and PlanetLab) are freque...
Since the introduction of cost-based query op-timization by Selinger et al. in their semi-nal paper,...
In SPARQL, conjunctive queries are expressed by using shared variables across sets of triple pattern...