We introduce a predictive modeling solution that provides high quality predictive analytics over aggregation queries in Big Data environments. Our predictive methodology is generally applicable in environments in which large-scale data owners may or may not restrict access to their data and allow only aggregation operators like COUNT to be executed over their data. In this context, our methodology is based on historical queries and their answers to accurately predict ad-hoc queries’ answers. We focus on the widely used set-cardinality, i.e., COUNT, aggregation query, as COUNT is a fundamental operator for both internal data system optimizations and for aggregation-oriented data exploration and predictive analytics. We contribute a novel, qu...
Regression analytics has been the standard approach to modeling the relationship between input and o...
Recent trends aim to incorporate advanced data analytics capabilities within DBMSs. Linear regressio...
In the Big Data era, there is a resurgence of interest in using Datalog to express data analysis app...
We introduce a predictive modeling solution that provides high quality predictive analytics over agg...
We study a novel solution to executing aggregation (and specifically COUNT) queries over large-scale...
We study a novel solution to executing aggregation (and specifically COUNT) queries over large-scal...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Fundamental to many predictive analytics tasks is the ability to estimate the cardinality (number of...
Nowadays, the increased amount of users' devices produce huge volumes of data that should be efficie...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Distance-based nearest neighbours (dNN) queries and aggregations over their answer sets are importan...
Distance-based nearest neighbours (dNN) queries and aggregations over their answer sets are importan...
Lack of knowledge in the underlying data distribution in distributed large-scale data can be an obst...
As the era of big data continues to evolve, the need for efficient and effective query processing in...
Regression analytics has been the standard approach to modeling the relationship between input and o...
Recent trends aim to incorporate advanced data analytics capabilities within DBMSs. Linear regressio...
In the Big Data era, there is a resurgence of interest in using Datalog to express data analysis app...
We introduce a predictive modeling solution that provides high quality predictive analytics over agg...
We study a novel solution to executing aggregation (and specifically COUNT) queries over large-scale...
We study a novel solution to executing aggregation (and specifically COUNT) queries over large-scal...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Fundamental to many predictive analytics tasks is the ability to estimate the cardinality (number of...
Nowadays, the increased amount of users' devices produce huge volumes of data that should be efficie...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Distance-based nearest neighbours (dNN) queries and aggregations over their answer sets are importan...
Distance-based nearest neighbours (dNN) queries and aggregations over their answer sets are importan...
Lack of knowledge in the underlying data distribution in distributed large-scale data can be an obst...
As the era of big data continues to evolve, the need for efficient and effective query processing in...
Regression analytics has been the standard approach to modeling the relationship between input and o...
Recent trends aim to incorporate advanced data analytics capabilities within DBMSs. Linear regressio...
In the Big Data era, there is a resurgence of interest in using Datalog to express data analysis app...