peer-reviewedDatabase sampling has become a popular approach to handle large amounts of data in a wide range of application areas such as data mining or approximate query evaluation. Using database samples is a potential solution when using the entire database is not cost-e ective, and a balance between the accuracy of the results and the computational cost of the process applied on the large data set is preferred. Existing sampling approaches are either limited to speci c application areas, to single table databases, or to random sampling. In this paper, we propose CoDS: a novel sampling approach targeting relational databases that ensures that the sample database follows the same distribution for specific fields as the original ...
Although approximate query processing is a prominent way to cope with the requirements of data analy...
Decision support queries usually involve accessing enormous amount of data requiring significant ret...
Abstract. Random sampling is a popular technique for providing fast approximate query answers, espec...
peer-reviewedManaging large amounts of information is one of the most expensive, time-consuming and...
Managing large amounts of information is one of the most expensive, time-consuming and non-trivial a...
peer-reviewedPopulating the testing environment with relevant data represents a great challenge in ...
Abstract—In a wide range of application areas (e.g. data mining, approximate query evaluation, histo...
AbstractRecently, we have proposed an adaptive, random-sampling algorithm for general query size est...
peer-reviewedIn a wide range of application areas (e.g. data mining, approximate query evaluation, ...
Database sampling is widely used in many database ap-plications when, for efficiency reasons, an ent...
Database sampling is widely used in many database applications when, for eciency reasons, an entire ...
Data mining is an emerging research area, whose goal is to extract significant patterns or interesti...
Random sampling is a popular technique for providing fast approximate query answers, especially in d...
In the wake of growing database that has already become the trend of today’s business environment wi...
Abstract. Random sampling is a popular technique for providing fast approximate query answers, espec...
Although approximate query processing is a prominent way to cope with the requirements of data analy...
Decision support queries usually involve accessing enormous amount of data requiring significant ret...
Abstract. Random sampling is a popular technique for providing fast approximate query answers, espec...
peer-reviewedManaging large amounts of information is one of the most expensive, time-consuming and...
Managing large amounts of information is one of the most expensive, time-consuming and non-trivial a...
peer-reviewedPopulating the testing environment with relevant data represents a great challenge in ...
Abstract—In a wide range of application areas (e.g. data mining, approximate query evaluation, histo...
AbstractRecently, we have proposed an adaptive, random-sampling algorithm for general query size est...
peer-reviewedIn a wide range of application areas (e.g. data mining, approximate query evaluation, ...
Database sampling is widely used in many database ap-plications when, for efficiency reasons, an ent...
Database sampling is widely used in many database applications when, for eciency reasons, an entire ...
Data mining is an emerging research area, whose goal is to extract significant patterns or interesti...
Random sampling is a popular technique for providing fast approximate query answers, especially in d...
In the wake of growing database that has already become the trend of today’s business environment wi...
Abstract. Random sampling is a popular technique for providing fast approximate query answers, espec...
Although approximate query processing is a prominent way to cope with the requirements of data analy...
Decision support queries usually involve accessing enormous amount of data requiring significant ret...
Abstract. Random sampling is a popular technique for providing fast approximate query answers, espec...