With the amount of data in current data warehouse databases growing steadily, random sampling is continuously gaining in importance. In particular, interactive analyses of large datasets can greatly benefit from the significantly shorter response times of approximate query processing. Typically, those analytical queries partition the data into groups and aggregate the values within the groups. Further, with the commonly used roll-up and drill-down operations a broad range of group-by queries is posed to the system, which makes the construction of highly-specialized synopses difficult. In this paper, we propose a general-purpose sampling scheme that is biased in order to answer group-by queries with high accuracy. While existing techniques f...
Approximate query processing based on random sampling is one of the most useful methods for the effi...
A large part of the data on the World Wide Web resides in the deep web. Executing structured, high-l...
A simple random sample of records from a large data warehouse may not contain sufficient number of r...
With the amount of data in current data warehouse databases growing steadily, random sampling is con...
With the amount of data in current data warehouse databases growing steadily, random sampling is con...
Approximate query processing is an adequate technique to reduce response times and system load in ca...
Random sampling has been widely used in approximate query processing on large databases, due to its ...
In Zeiten wachsender Datenbankgrößen ist es unumgänglich, Anfragen näherungsweise auszuwerten um sch...
In decision support applications, the ability to provide fast approximate answers to aggregation que...
Random sampling is a popular technique for providing fast approximate query answers, especially in d...
Although approximate query processing is a prominent way to cope with the requirements of data analy...
Decision support queries usually involve accessing enormous amount of data requiring significant ret...
In processing large quantities of data, a fundamental problem is to obtain a summary which supports ...
Abstract. Random sampling is a popular technique for providing fast approximate query answers, espec...
Many discovery problems, e.g., subgroup or association rule discovery, can naturally be cast as n-be...
Approximate query processing based on random sampling is one of the most useful methods for the effi...
A large part of the data on the World Wide Web resides in the deep web. Executing structured, high-l...
A simple random sample of records from a large data warehouse may not contain sufficient number of r...
With the amount of data in current data warehouse databases growing steadily, random sampling is con...
With the amount of data in current data warehouse databases growing steadily, random sampling is con...
Approximate query processing is an adequate technique to reduce response times and system load in ca...
Random sampling has been widely used in approximate query processing on large databases, due to its ...
In Zeiten wachsender Datenbankgrößen ist es unumgänglich, Anfragen näherungsweise auszuwerten um sch...
In decision support applications, the ability to provide fast approximate answers to aggregation que...
Random sampling is a popular technique for providing fast approximate query answers, especially in d...
Although approximate query processing is a prominent way to cope with the requirements of data analy...
Decision support queries usually involve accessing enormous amount of data requiring significant ret...
In processing large quantities of data, a fundamental problem is to obtain a summary which supports ...
Abstract. Random sampling is a popular technique for providing fast approximate query answers, espec...
Many discovery problems, e.g., subgroup or association rule discovery, can naturally be cast as n-be...
Approximate query processing based on random sampling is one of the most useful methods for the effi...
A large part of the data on the World Wide Web resides in the deep web. Executing structured, high-l...
A simple random sample of records from a large data warehouse may not contain sufficient number of r...