In data streaming we work with large data from multiple sources. We observe overloaded partitions due to simple hash partitioning. We use partitioning algorithms that require a sampling algorithm to estimate the distribution of the stream. There is no standardized way to test these algorithms. We offer a framework with benchmark and hyperparameter optimization. Our work includes a generator for bursts and concept drifts. We provide algorithms that react to concept drifts
Abstract. We explore the possibilities of meta-learning on data streams, in particular algorithm sel...
With the development of computing systems in every sector of activity, more and more data is now ava...
The need for scalable and efficient stream analysis has led to the development of many open-source s...
In data streaming we work with large data from multiple sources. We observe overloaded partitions du...
Streaming data analysis has recently attracted at-tention in numerous applications including telepho...
We give a survey at some algorithmic techniques for processing data streams. After covering the basi...
Massive data sets are increasingly important in a wide range of applications, including observationa...
One response to the proliferation of large datasets has been to develop ingenious ways to throw reso...
Complex queries over high speed data streams often need to rely on approximations to keep up with th...
We introduce an alternative to reservoir sampling, a classic and popular algorithm for drawing a fix...
Data stream processing has gained increasing popularity in the last few years as an effective paradi...
Streaming is an important paradigm for handling high-speed data sets that are too large to fit in ma...
Abstract –We consider estimation of arbitrary range partitioning of data values and ranking of frequ...
Streaming algorithms must process a large quantity of small updates quickly to allow queries about t...
We explore the possibilities of meta-learning on data streams, in particular algorithm selection. In...
Abstract. We explore the possibilities of meta-learning on data streams, in particular algorithm sel...
With the development of computing systems in every sector of activity, more and more data is now ava...
The need for scalable and efficient stream analysis has led to the development of many open-source s...
In data streaming we work with large data from multiple sources. We observe overloaded partitions du...
Streaming data analysis has recently attracted at-tention in numerous applications including telepho...
We give a survey at some algorithmic techniques for processing data streams. After covering the basi...
Massive data sets are increasingly important in a wide range of applications, including observationa...
One response to the proliferation of large datasets has been to develop ingenious ways to throw reso...
Complex queries over high speed data streams often need to rely on approximations to keep up with th...
We introduce an alternative to reservoir sampling, a classic and popular algorithm for drawing a fix...
Data stream processing has gained increasing popularity in the last few years as an effective paradi...
Streaming is an important paradigm for handling high-speed data sets that are too large to fit in ma...
Abstract –We consider estimation of arbitrary range partitioning of data values and ranking of frequ...
Streaming algorithms must process a large quantity of small updates quickly to allow queries about t...
We explore the possibilities of meta-learning on data streams, in particular algorithm selection. In...
Abstract. We explore the possibilities of meta-learning on data streams, in particular algorithm sel...
With the development of computing systems in every sector of activity, more and more data is now ava...
The need for scalable and efficient stream analysis has led to the development of many open-source s...