Accurate selectivity estimations are essential for query optimization decisions where they are typically derived from various kinds of histograms which condense value distributions into compact representations. The estimation accuracy of existing approaches typically varies across the domain, with some estimations being very accurate and some quite inaccurate. This is in particular unfortunate when performing a parametric search using these estimations, as the estimation artifacts can dominate the search results. We propose the usage of linear splines to construct histograms with known error guarantees across the whole continuous domain. These histograms are particularly well suited for using the estimates in parameter optimization. We show...
Histograms and Wavelet synopses provide useful tools in query optimization and approximate query ans...
One of the most difficult tasks in modern day database management systems is information retrieval. ...
Histograms that guarantee a maximum multiplicative error (q-error) for estimates may significantly i...
Accurate selectivity estimations are essential for query optimization decisions where they are typic...
This paper aims to improve the accuracy of query result-size estimations in query optimizers by leve...
We have solved the following problem using pattern classification techniques (PCT): given two histog...
We have solved the following problem using Pattern Classijication Techniques (PCT): Given two histog...
Histogram techniques have been used in many commercial database management systems to estimate a que...
Histograms have long been used to capture attribute value distribution statistics for query optimize...
Histogram representation of a large set of data is a good way for summarizing and visualize data an...
Random sampling is a standard technique for constructing (approximate) histograms for query optimiza...
The database query optimizer requires the estimation of the query selectivity to find the most effic...
An assumption with self-tuning histograms has been that they can "learn" the dataset if given enough...
Data-driven research is often hampered by privacy restrictions in the form of limited datasets or gr...
An algorithm for an approximating function to the frequency distribution is obtained from a sample o...
Histograms and Wavelet synopses provide useful tools in query optimization and approximate query ans...
One of the most difficult tasks in modern day database management systems is information retrieval. ...
Histograms that guarantee a maximum multiplicative error (q-error) for estimates may significantly i...
Accurate selectivity estimations are essential for query optimization decisions where they are typic...
This paper aims to improve the accuracy of query result-size estimations in query optimizers by leve...
We have solved the following problem using pattern classification techniques (PCT): given two histog...
We have solved the following problem using Pattern Classijication Techniques (PCT): Given two histog...
Histogram techniques have been used in many commercial database management systems to estimate a que...
Histograms have long been used to capture attribute value distribution statistics for query optimize...
Histogram representation of a large set of data is a good way for summarizing and visualize data an...
Random sampling is a standard technique for constructing (approximate) histograms for query optimiza...
The database query optimizer requires the estimation of the query selectivity to find the most effic...
An assumption with self-tuning histograms has been that they can "learn" the dataset if given enough...
Data-driven research is often hampered by privacy restrictions in the form of limited datasets or gr...
An algorithm for an approximating function to the frequency distribution is obtained from a sample o...
Histograms and Wavelet synopses provide useful tools in query optimization and approximate query ans...
One of the most difficult tasks in modern day database management systems is information retrieval. ...
Histograms that guarantee a maximum multiplicative error (q-error) for estimates may significantly i...