Continuous applications such as device monitoring and anomaly detection often require real-time aggregated statistics over unbounded data streams. While existing stream processing systems such as Flink, Spark, and Storm support processing of streaming aggregations, their optimizations are limited with respect to the dynamic nature of the data, and therefore are suboptimal when the workload changes and/or when there is data skew. In this paper we present AdCom, which is an adaptive combiner for stream processing engines. The use of AdCom in aggregation queries enables pre-aggregating tuples upstream (i.e., before data shuffling) followed by global aggregation downstream. In contrast to existing approaches, AdCom can automatically adjust the ...
Many algorithms have been proposed to approximate holistic aggregates, such as quantiles and heavy h...
We briefly describe our study on the problem of streaming multiway aggregation [5], where large data...
High throughput stream aggregation is essential for many applications that analyze massive volumes o...
Continuous applications such as device monitoring and anomaly detection often require real-time aggr...
In many data gathering applications, information arrives in the form of continuous streams rather th...
We present JetStream, a system that allows real-time analysis of large, widely-distributed changing ...
In this paper, we study the problem of streaming multiway aggre-gation, where large data volumes are...
Abstract. Monitoring aggregates on IP traffic data streams is a compelling appli-cation for data str...
Window aggregation is a core operation in data stream processing. Existing aggregation techniques fo...
High throughput and low latency stream aggregation - and stream processing in general - is critical ...
Stream processing is gaining importance as more data becomes available in the form of continuous str...
Many applications must ingest rapid streams of data and produce analytics results in near-real-...
High performance stream aggregation is critical for many emerging applications that analyze massive ...
We are in an era of big data, sensors, and monitoring technology. One consequence of this technology...
Abstract: The study on streaming data is one of the hot topics among the database circle all over th...
Many algorithms have been proposed to approximate holistic aggregates, such as quantiles and heavy h...
We briefly describe our study on the problem of streaming multiway aggregation [5], where large data...
High throughput stream aggregation is essential for many applications that analyze massive volumes o...
Continuous applications such as device monitoring and anomaly detection often require real-time aggr...
In many data gathering applications, information arrives in the form of continuous streams rather th...
We present JetStream, a system that allows real-time analysis of large, widely-distributed changing ...
In this paper, we study the problem of streaming multiway aggre-gation, where large data volumes are...
Abstract. Monitoring aggregates on IP traffic data streams is a compelling appli-cation for data str...
Window aggregation is a core operation in data stream processing. Existing aggregation techniques fo...
High throughput and low latency stream aggregation - and stream processing in general - is critical ...
Stream processing is gaining importance as more data becomes available in the form of continuous str...
Many applications must ingest rapid streams of data and produce analytics results in near-real-...
High performance stream aggregation is critical for many emerging applications that analyze massive ...
We are in an era of big data, sensors, and monitoring technology. One consequence of this technology...
Abstract: The study on streaming data is one of the hot topics among the database circle all over th...
Many algorithms have been proposed to approximate holistic aggregates, such as quantiles and heavy h...
We briefly describe our study on the problem of streaming multiway aggregation [5], where large data...
High throughput stream aggregation is essential for many applications that analyze massive volumes o...