We present multiple pass streaming algorithms for a basic clustering problem for massive data sets. If our algorithm is allotted 2l passes, it will produce an approximation with error at most ε using Õ(k3/ε2/l) bits of memory, the most critical resource for streaming computation. We demonstrate that this tradeoff between passes and memory allotted is intrinsic to the problem and model of computation by proving lower bounds on the memory requirements of any l pass randomized algorithm that are nearly matched by our upper bounds. To the best of our knowledge, this is the first time nearly matching bounds have been proved for such an exponential tradeoff for randomized computation.In this problem, we are given a set of n points drawn randomly ...
As data gathering grows easier, and as researchers discover new ways to interpret data, streaming-da...
Streaming algorithms, which process very large datasets received one update at a time, are a key too...
In this paper we investigate algorithms and lower bounds for summarization problems over a single ...
We present multiple pass streaming algorithms for a basic clustering problem for massive data sets. ...
Streaming data analysis has recently attracted at-tention in numerous applications including telepho...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
We study clustering under the data stream model of computation where: given a sequence of points, th...
The last decade witnessed the extensive studies of algorithms for data streams. In this model, the i...
The last decade witnessed the extensive studies of algorithms for data streams. In this model, the i...
AbstractWe present a multiple pass streaming algorithm for learning the density function of a mixtur...
In this paper we study the space requirement of algorithms that make only one (or a small number of)...
Exact solutions are unattainable for important problems. The calculations are limited by the memory ...
We present a multiple pass streaming algorithm for learning the density function of a mixture of $k...
As data gathering grows easier, and as researchers discover new ways to interpret data, streaming-da...
Streaming algorithms, which process very large datasets received one update at a time, are a key too...
In this paper we investigate algorithms and lower bounds for summarization problems over a single ...
We present multiple pass streaming algorithms for a basic clustering problem for massive data sets. ...
Streaming data analysis has recently attracted at-tention in numerous applications including telepho...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
We study clustering under the data stream model of computation where: given a sequence of points, th...
The last decade witnessed the extensive studies of algorithms for data streams. In this model, the i...
The last decade witnessed the extensive studies of algorithms for data streams. In this model, the i...
AbstractWe present a multiple pass streaming algorithm for learning the density function of a mixtur...
In this paper we study the space requirement of algorithms that make only one (or a small number of)...
Exact solutions are unattainable for important problems. The calculations are limited by the memory ...
We present a multiple pass streaming algorithm for learning the density function of a mixture of $k...
As data gathering grows easier, and as researchers discover new ways to interpret data, streaming-da...
Streaming algorithms, which process very large datasets received one update at a time, are a key too...
In this paper we investigate algorithms and lower bounds for summarization problems over a single ...