Optimization of join queries based on average selectivities is suboptimal in highly correlated databases. In such databases, relations are naturally divided into partitions, each partition having substantially different statistical characteristics. It is very compelling to discover such data partitions during query optimization and create multiple plans for a given query, one plan being optimal for a particular combination of data partitions. This scenario calls for the sharing of state among plans, so that common intermediate results are not recomputed. We study this problem in a setting with a routing-based query execution engine based on eddies [1]. Eddies naturally encapsulate horizontal partitioning and maximal state sharing across mul...
Advanced Data Mining applications require more and more support from relational database engines. Es...
Abstract. Due to high data volumes and unpredictable arrival rates, continuous query systems process...
Today’s data deluge enables organizations to collect massive data, and analyze it with an ever-incre...
It is known that optimization of join queries based on average selectivities is sub-optimal in highl...
Thesis (Ph.D.)--University of Washington, 2021As the demand for data intensive pipelines has grown a...
We present a query architecture in which join operators are decomposed into their constituent data s...
Table partitioning splits a table into smaller parts that can be accessed, stored, and maintained in...
Multiway join queries incur high-cost I/Os operations over large-scale data. Exploiting sharing join...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
In many database queries relations are access multiple times during query processing. In these cases...
Fast and accurate estimations for complex queries are profoundly beneficial for large databases with...
In many database queries relations are access multiple times during query processing. In these cases...
Join optimization is one of the most challenging tasks in query processing. The perfor-mance of join...
We apply the one shot semi-join execution strategy, which allows parallelism and processes multiple ...
Summarization: An emerging challenge in modern distributed querying is to effi- ciently process mult...
Advanced Data Mining applications require more and more support from relational database engines. Es...
Abstract. Due to high data volumes and unpredictable arrival rates, continuous query systems process...
Today’s data deluge enables organizations to collect massive data, and analyze it with an ever-incre...
It is known that optimization of join queries based on average selectivities is sub-optimal in highl...
Thesis (Ph.D.)--University of Washington, 2021As the demand for data intensive pipelines has grown a...
We present a query architecture in which join operators are decomposed into their constituent data s...
Table partitioning splits a table into smaller parts that can be accessed, stored, and maintained in...
Multiway join queries incur high-cost I/Os operations over large-scale data. Exploiting sharing join...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
In many database queries relations are access multiple times during query processing. In these cases...
Fast and accurate estimations for complex queries are profoundly beneficial for large databases with...
In many database queries relations are access multiple times during query processing. In these cases...
Join optimization is one of the most challenging tasks in query processing. The perfor-mance of join...
We apply the one shot semi-join execution strategy, which allows parallelism and processes multiple ...
Summarization: An emerging challenge in modern distributed querying is to effi- ciently process mult...
Advanced Data Mining applications require more and more support from relational database engines. Es...
Abstract. Due to high data volumes and unpredictable arrival rates, continuous query systems process...
Today’s data deluge enables organizations to collect massive data, and analyze it with an ever-incre...