We present efficient parallel streaming algorithms for fundamental frequency-based aggregates in both the sliding window and the infinite window settings. In the sliding window setting, we give a parallel algorithm for maintaining a space-bounded block counter (SBBC). Using SBBC, we derive algorithms for basic counting, frequency estimation, and heavy hitters that perform no more work than their best sequential counterparts. In the infinite window setting, we present algorithms for frequency estimation, heavy hitters, and count-min sketch. For both the infinite window and sliding window settings, our parallel algorithms process a minibatch of items using linear work and polylog parallel depth. We also prove a lower bound showing that the ...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
The computation of sliding window aggregates is one of the core functionalities of stream processing...
IntroductionStreaming services are highly popular today. Millions of people watch live streams or vi...
We present efficient parallel streaming algorithms for fundamental frequency-based aggregates in bot...
This paper investigates parallel random sampling from a potentially-unending data stream whose eleme...
In this paper we present PFDCMSS (Parallel Forward Decay Count-Min Space Saving) which, to the best ...
Sketches are probabilistic data structures that can provide approx- imate results within mathematica...
The number of triangles in a graph is a fundamental metric widely used in social network analysis, l...
National audienceComputing aggregation over sliding windows, i.e., finite subsets of an unbounded st...
AbstractIn this paper we show how parallel algorithms can be turned into efficient streaming algorit...
International audienceWe investigate the problem of estimating on the fly the frequency at which ite...
Computing functions over a distributed stream of data is a significant problem with practical applic...
Streaming model supplies solutions for handling enormous data flows for over 20 years now. The mode...
Count queries belong to a class of summary statistics routinely used in basket analysis, inventory t...
This electronic version was submitted by the student author. The certified thesis is available in th...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
The computation of sliding window aggregates is one of the core functionalities of stream processing...
IntroductionStreaming services are highly popular today. Millions of people watch live streams or vi...
We present efficient parallel streaming algorithms for fundamental frequency-based aggregates in bot...
This paper investigates parallel random sampling from a potentially-unending data stream whose eleme...
In this paper we present PFDCMSS (Parallel Forward Decay Count-Min Space Saving) which, to the best ...
Sketches are probabilistic data structures that can provide approx- imate results within mathematica...
The number of triangles in a graph is a fundamental metric widely used in social network analysis, l...
National audienceComputing aggregation over sliding windows, i.e., finite subsets of an unbounded st...
AbstractIn this paper we show how parallel algorithms can be turned into efficient streaming algorit...
International audienceWe investigate the problem of estimating on the fly the frequency at which ite...
Computing functions over a distributed stream of data is a significant problem with practical applic...
Streaming model supplies solutions for handling enormous data flows for over 20 years now. The mode...
Count queries belong to a class of summary statistics routinely used in basket analysis, inventory t...
This electronic version was submitted by the student author. The certified thesis is available in th...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
The computation of sliding window aggregates is one of the core functionalities of stream processing...
IntroductionStreaming services are highly popular today. Millions of people watch live streams or vi...