Abstract. Random sampling is a popular technique for providing fast approximate query answers, especially in data warehouse environments. Compared to other types of synopses, random sampling bears the ad-vantage of retaining the dataset’s dimensionality; it also associates prob-abilistic error bounds with the query results. Most of the available sam-pling techniques focus on table-level sampling, that is, they produce a sample of only a single database table. Queries that contain joins over multiple tables cannot be answered with such samples because join re-sults on random samples are often small and skewed. On the contrary, schema-level sampling techniques by design support queries containing joins. In this paper, we introduce Linked Bern...
peer-reviewedDatabase sampling has become a popular approach to handle large amounts of data in a w...
With the amount of data in current data warehouse databases growing steadily, random sampling is con...
AbstractWe present an adaptive, random sampling algorithm for estimating the size of general queries...
Random sampling is a popular technique for providing fast approximate query answers, especially in d...
Abstract. Random sampling is a popular technique for providing fast approximate query answers, espec...
Approximate query processing is an adequate technique to reduce response times and system load in ca...
textabstractUniform sampling of join orders is known to be a competitive alternative to transformati...
Abstract. Uniform sampling of join orders is known to be a competitive alternative to transformation...
AbstractRecently, we have proposed an adaptive, random-sampling algorithm for general query size est...
Join queries are a fundamental database tool, capturing a range of tasks that involve linking hetero...
We compare the performance of sampling-based procedures for estimating the selectivity of a join. Wh...
AbstractWe compare the performance of sampling-based procedures for estimating the selectivity of a ...
Modern databases face formidable challenges when called to join (several) massive tables. Joins (esp...
Random sampling has become a crucial component of modern data management systems. Although the liter...
Fast and accurate estimations for complex queries are profoundly beneficial for large databases with...
peer-reviewedDatabase sampling has become a popular approach to handle large amounts of data in a w...
With the amount of data in current data warehouse databases growing steadily, random sampling is con...
AbstractWe present an adaptive, random sampling algorithm for estimating the size of general queries...
Random sampling is a popular technique for providing fast approximate query answers, especially in d...
Abstract. Random sampling is a popular technique for providing fast approximate query answers, espec...
Approximate query processing is an adequate technique to reduce response times and system load in ca...
textabstractUniform sampling of join orders is known to be a competitive alternative to transformati...
Abstract. Uniform sampling of join orders is known to be a competitive alternative to transformation...
AbstractRecently, we have proposed an adaptive, random-sampling algorithm for general query size est...
Join queries are a fundamental database tool, capturing a range of tasks that involve linking hetero...
We compare the performance of sampling-based procedures for estimating the selectivity of a join. Wh...
AbstractWe compare the performance of sampling-based procedures for estimating the selectivity of a ...
Modern databases face formidable challenges when called to join (several) massive tables. Joins (esp...
Random sampling has become a crucial component of modern data management systems. Although the liter...
Fast and accurate estimations for complex queries are profoundly beneficial for large databases with...
peer-reviewedDatabase sampling has become a popular approach to handle large amounts of data in a w...
With the amount of data in current data warehouse databases growing steadily, random sampling is con...
AbstractWe present an adaptive, random sampling algorithm for estimating the size of general queries...