AbstractWe present a multiple pass streaming algorithm for learning the density function of a mixture of k uniform distributions over rectangles in Rd, for any d>0. Our learning model is: samples drawn according to the mixture are placed in arbitrary order in a data stream that may only be accessed sequentially by an algorithm with a very limited random access memory space. Our algorithm makes 2ℓ+2 passes, for any ℓ>0, and requires memory at most Õ(ϵ−2/ℓk2d4+(4k)d), where ϵ is the tolerable error of the algorithm. This exhibits a strong memory-pass tradeoff in terms of ϵ: a few more passes significantly lower its memory requirements, thus trading one of the two most important resources in streaming computation for the other. Chang and Kann...
We give an algorithm for learning a mixture of unstructured distributions. This problem arises in va...
We consider the problem of learning mixtures of product distributions over discrete domains in the d...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
We present a multiple pass streaming algorithm for learning the density function of a mixture of $k...
AbstractWe present a multiple pass streaming algorithm for learning the density function of a mixtur...
We present multiple pass streaming algorithms for a basic clustering problem for massive data sets. ...
We present multiple pass streaming algorithms for a basic clustering problem for massive data sets. ...
Let C be a class of probability distributions over the discrete domain [n] = {1,..., n}. We show th...
In supervised learning, availability of sufficient labeled data is of prime importance. Unfortunatel...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
We consider PAC learning of probability distributions (a.k.a. density estimation), where we are give...
We give an algorithm for learning a mixture of unstructured distributions. This problem arises in va...
We give an algorithm for learning a mixture of unstructured distributions. This problem arises in va...
We give an algorithm for learning a mixture of unstructured distributions. This problem arises in va...
We consider the problem of learning mixtures of product distributions over discrete domains in the d...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
We present a multiple pass streaming algorithm for learning the density function of a mixture of $k...
AbstractWe present a multiple pass streaming algorithm for learning the density function of a mixtur...
We present multiple pass streaming algorithms for a basic clustering problem for massive data sets. ...
We present multiple pass streaming algorithms for a basic clustering problem for massive data sets. ...
Let C be a class of probability distributions over the discrete domain [n] = {1,..., n}. We show th...
In supervised learning, availability of sufficient labeled data is of prime importance. Unfortunatel...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...
We consider PAC learning of probability distributions (a.k.a. density estimation), where we are give...
We give an algorithm for learning a mixture of unstructured distributions. This problem arises in va...
We give an algorithm for learning a mixture of unstructured distributions. This problem arises in va...
We give an algorithm for learning a mixture of unstructured distributions. This problem arises in va...
We consider the problem of learning mixtures of product distributions over discrete domains in the d...
In this dissertation, we make progress on certain algorithmic problems broadly over two computationa...