In modern applications, it is a big challenge that analyzing the order statistics about the most recent parts of the high-volume and high velocity stream data. There are some online quantile algorithms that can keep the sketch of the data in the sliding window and they can answer the quantile or rank query in a very short time. But most of them take the GK algorithm as the subroutine, which is not known to be mergeable. In this paper, we propose another algorithm to keep the sketch that maintains the order statistics over sliding windows. For the fixed-size window, the existing algorithms can’t maintain the correctness in the process of updating the sliding window. Our algorithm not only can maintain the correctness but also can achieve sim...
While traditional data-management systems focus on evaluating single, ad-hoc queries over static dat...
Computing aggregates over windows is at the core of virtually every stream processing job. Typical s...
The fast evolution of data analytics platforms has resulted in an increasing demand for real-time da...
In modern applications, it is a big challenge that analyzing the order statistics about the most rec...
Statistics computation over data streams is often required by many applications, including processin...
Sliding windows are bounded sets which evolve together with an infinite data stream of records. Each...
\u3cp\u3eWhile traditional data management systems focus on evaluating single, ad hoc queries over s...
In this paper we extend the study of algorithms for monitoring distributed data streams from whole d...
In many online applications, we need to maintain quantile statistics for a sliding window on a data ...
Continuous queries applied over nonterminating data streams usually specify windows in order to obta...
Window aggregation is a core operation in data stream processing. Existing aggregation techniques fo...
This paper presents algorithms for estimating aggregate functions over a "sliding window"...
High-volume data streams are too large and grow too quickly to store entirely in working memory, int...
This paper presents algorithms for estimating aggregate functions over a “sliding window ” of the N ...
International audienceEstimating the frequency of any piece of information in large-scale distribute...
While traditional data-management systems focus on evaluating single, ad-hoc queries over static dat...
Computing aggregates over windows is at the core of virtually every stream processing job. Typical s...
The fast evolution of data analytics platforms has resulted in an increasing demand for real-time da...
In modern applications, it is a big challenge that analyzing the order statistics about the most rec...
Statistics computation over data streams is often required by many applications, including processin...
Sliding windows are bounded sets which evolve together with an infinite data stream of records. Each...
\u3cp\u3eWhile traditional data management systems focus on evaluating single, ad hoc queries over s...
In this paper we extend the study of algorithms for monitoring distributed data streams from whole d...
In many online applications, we need to maintain quantile statistics for a sliding window on a data ...
Continuous queries applied over nonterminating data streams usually specify windows in order to obta...
Window aggregation is a core operation in data stream processing. Existing aggregation techniques fo...
This paper presents algorithms for estimating aggregate functions over a "sliding window"...
High-volume data streams are too large and grow too quickly to store entirely in working memory, int...
This paper presents algorithms for estimating aggregate functions over a “sliding window ” of the N ...
International audienceEstimating the frequency of any piece of information in large-scale distribute...
While traditional data-management systems focus on evaluating single, ad-hoc queries over static dat...
Computing aggregates over windows is at the core of virtually every stream processing job. Typical s...
The fast evolution of data analytics platforms has resulted in an increasing demand for real-time da...