Fast and accurate estimations for complex queries are profoundly beneficial for large databases with heavy workloads. The most widely adopted query optimizers use synopses to tune up the databases in manners of optimization and re-optimization. From Chapter 1 to Chapter 3, we focus on the synopses for query optimization. We propose a statistical summary for a database, called CS2 (Correlated Sample Synopsis), to provide rapid and accurate result size estimations for all queries with joins and arbitrary selections. Unlike the state-of-the-art techniques, CS2 does not completely rely on simple random samples, but mainly consists of correlated sample tuples that retain join relationships with less storage. We introduce a statistical technique...
The use of business intelligence tools and other means to generate queries has led to great variety ...
Join ordering is one of the most important, but also most challenging problems of query optimization...
Most modern DBMS optimizers rely upon a cost model to choose the best query execution plan (QEP) for...
Fast and accurate estimations for complex queries are profoundly beneficial for large databases with...
Thesis (Ph.D.)--University of Washington, 2021As the demand for data intensive pipelines has grown a...
AbstractRecently, we have proposed an adaptive, random-sampling algorithm for general query size est...
The query processor of a relational database system executes declarative queries on relational data ...
Most modern applications deal with very large amounts of data. Having to deal with such huge amounts...
Finding a good join order is crucial for query performance. In this paper, we introduce the Join Ord...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
The ability to estimate resource consumption of SQL queries is cru-cial for a number of tasks in a d...
textabstractUniform sampling of join orders is known to be a competitive alternative to transformati...
textabstractTesting an SQL database system by running large sets of deterministic or stochastic SQL ...
Optimization of join queries based on average selectivities is suboptimal in highly correlated datab...
Testing an SQL database system by running large sets of deterministic or stochastic SQL statements i...
The use of business intelligence tools and other means to generate queries has led to great variety ...
Join ordering is one of the most important, but also most challenging problems of query optimization...
Most modern DBMS optimizers rely upon a cost model to choose the best query execution plan (QEP) for...
Fast and accurate estimations for complex queries are profoundly beneficial for large databases with...
Thesis (Ph.D.)--University of Washington, 2021As the demand for data intensive pipelines has grown a...
AbstractRecently, we have proposed an adaptive, random-sampling algorithm for general query size est...
The query processor of a relational database system executes declarative queries on relational data ...
Most modern applications deal with very large amounts of data. Having to deal with such huge amounts...
Finding a good join order is crucial for query performance. In this paper, we introduce the Join Ord...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
The ability to estimate resource consumption of SQL queries is cru-cial for a number of tasks in a d...
textabstractUniform sampling of join orders is known to be a competitive alternative to transformati...
textabstractTesting an SQL database system by running large sets of deterministic or stochastic SQL ...
Optimization of join queries based on average selectivities is suboptimal in highly correlated datab...
Testing an SQL database system by running large sets of deterministic or stochastic SQL statements i...
The use of business intelligence tools and other means to generate queries has led to great variety ...
Join ordering is one of the most important, but also most challenging problems of query optimization...
Most modern DBMS optimizers rely upon a cost model to choose the best query execution plan (QEP) for...