Streaming is an important paradigm for handling high-speed data sets that are too large to fit in main memory. Prior work in data streams has shown how to estimate simple statistical parameters, such as histograms, heavy hitters, frequent moments, etc., on data streams. This dissertation focuses on a number of more sophisticated statistical analyses that are performed in near real-time, using limited resources. First, we present how to model stream data parametrically; in particular, we fit hierarchical (binomial multifractal) and non-hierarchical (Pareto) power-law models on a data stream. It yields algorithms that are fast, space-efficient, and provide accuracy guarantees. We also design fast methods to perform online model validation at ...
Abstract. Discovering frequent patterns over event sequences is an important data mining problem. Ex...
Streaming data analysis has recently attracted at-tention in numerous applications including telepho...
Network analysis and graph mining play a prominent role in providing insights and studying phenomena...
Data stream applications have made use of statistical summaries to reason about the data using nonpa...
Thesis (Ph. D.)--University of Rochester. Dept. of Mathematics, 2008.The algorithmic field of Data S...
Data stream processing has gained increasing popularity in the last few years as an effective paradi...
Streaming algorithms, which process very large datasets received one update at a time, are a key too...
A data stream is a transiently observed sequence of data elements that arrive unordered, with repeti...
Data streams are ubiquitous. Examples include the network traffic flowing past a router, data genera...
Streaming data is becoming more prevalent in our society every day. With the increasing use of techn...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
High-volume data streams are too large and grow too quickly to store entirely in working memory, int...
Using entropy of traffic distributions has been shown to aid a wide variety of network monitoring ap...
International audienceWe investigate the problem of estimating on the fly the frequency at which ite...
Exact solutions are unattainable for important problems. The calculations are limited by the memory ...
Abstract. Discovering frequent patterns over event sequences is an important data mining problem. Ex...
Streaming data analysis has recently attracted at-tention in numerous applications including telepho...
Network analysis and graph mining play a prominent role in providing insights and studying phenomena...
Data stream applications have made use of statistical summaries to reason about the data using nonpa...
Thesis (Ph. D.)--University of Rochester. Dept. of Mathematics, 2008.The algorithmic field of Data S...
Data stream processing has gained increasing popularity in the last few years as an effective paradi...
Streaming algorithms, which process very large datasets received one update at a time, are a key too...
A data stream is a transiently observed sequence of data elements that arrive unordered, with repeti...
Data streams are ubiquitous. Examples include the network traffic flowing past a router, data genera...
Streaming data is becoming more prevalent in our society every day. With the increasing use of techn...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
High-volume data streams are too large and grow too quickly to store entirely in working memory, int...
Using entropy of traffic distributions has been shown to aid a wide variety of network monitoring ap...
International audienceWe investigate the problem of estimating on the fly the frequency at which ite...
Exact solutions are unattainable for important problems. The calculations are limited by the memory ...
Abstract. Discovering frequent patterns over event sequences is an important data mining problem. Ex...
Streaming data analysis has recently attracted at-tention in numerous applications including telepho...
Network analysis and graph mining play a prominent role in providing insights and studying phenomena...