We study a novel solution to executing aggregation (and specifically COUNT) queries over large-scale data. The proposed solution is generally applicable, in the sense that it can be deployed in environments in which data owners may or may not restrict access to their data and allow only 'aggregation operators' to be executed over their data. For this, it is based on predictive analytics, driven by queries and their results. We propose a machine learning (ML) framework for the task (which can be adapted for different aggregates as well). We focus on the widely used set-cardinality (i.e., COUNT) aggregation operator, as it is a fundamental operator for both internal data system optimisations and for aggregation-query analytics. We contribute ...
The digitization of our lives cause a shift in the data production as well as in the required data m...
As the era of big data continues to evolve, the need for efficient and effective query processing in...
Recent trends aim to incorporate advanced data analytics capabilities within DBMSs. Linear regressio...
We introduce a predictive modeling solution that provides high quality predictive analytics over agg...
We introduce a predictive modeling solution that provides high quality predictive analytics over agg...
We study a novel solution to executing aggregation (and specifically COUNT) queries over large-scal...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Fundamental to many predictive analytics tasks is the ability to estimate the cardinality (number of...
Nowadays, the increased amount of users' devices produce huge volumes of data that should be efficie...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Distance-based nearest neighbours (dNN) queries and aggregations over their answer sets are importan...
Distance-based nearest neighbours (dNN) queries and aggregations over their answer sets are importan...
The digitization of our lives cause a shift in the data production as well as in the required data m...
Lack of knowledge in the underlying data distribution in distributed large-scale data can be an obst...
The digitization of our lives cause a shift in the data production as well as in the required data m...
As the era of big data continues to evolve, the need for efficient and effective query processing in...
Recent trends aim to incorporate advanced data analytics capabilities within DBMSs. Linear regressio...
We introduce a predictive modeling solution that provides high quality predictive analytics over agg...
We introduce a predictive modeling solution that provides high quality predictive analytics over agg...
We study a novel solution to executing aggregation (and specifically COUNT) queries over large-scal...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Fundamental to many predictive analytics tasks is the ability to estimate the cardinality (number of...
Nowadays, the increased amount of users' devices produce huge volumes of data that should be efficie...
Large organizations have seamlessly incorporated data-driven decision making in their operations. Ho...
Distance-based nearest neighbours (dNN) queries and aggregations over their answer sets are importan...
Distance-based nearest neighbours (dNN) queries and aggregations over their answer sets are importan...
The digitization of our lives cause a shift in the data production as well as in the required data m...
Lack of knowledge in the underlying data distribution in distributed large-scale data can be an obst...
The digitization of our lives cause a shift in the data production as well as in the required data m...
As the era of big data continues to evolve, the need for efficient and effective query processing in...
Recent trends aim to incorporate advanced data analytics capabilities within DBMSs. Linear regressio...