How can we train a statistical mixture model on a massive data set? In this work we show how to construct coresets for mixtures of Gaussians. A coreset is a weighted subset of the data, which guarantees that models fitting the coreset also provide a good fit for the original data set. We show that, perhaps surprisingly, Gaussian mixtures admit coresets of size polynomial in dimension and the number of mixture components, while being independent of the data set size. Hence, one can harness computationally intensive algorithms to compute a good approximation on a significantly smaller data set. More importantly, such coresets can be efficiently constructed both in distributed and streaming settings and do not impose restrictions on the data g...
Abstract: In the domain of unsupervised learning, mixtures of Gaussians have become a popular tool f...
<p>While several papers have investigated computationally and statistically efficient methods for le...
Abstract. Gaussian mixture models are a widespread tool for mod-eling various and complex probabilit...
Abstract How can we train a statistical mixture model on a massive data set? In this paper, we show ...
How can we train a statistical mixture model on a massive data set? In this work we show how to cons...
Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately estimate...
Mixtures of gaussian (or normal) distributions arise in a variety of application areas. Many techniq...
In this paper we show that very large mixtures of Gaussians are efficiently learnable in high dimens...
University of AmsterdamWe present a deterministic greedy method to learn a mixture of Gaussians whic...
For every epsilon > 0, we give an efficient algorithm to learn the cluster centers of a mixture of p...
We provide an algorithm for properly learning mixtures of two single-dimensional Gaussians with-out ...
Mixtures of Gaussian (or normal) distributions arise in a variety of application areas. Many heurist...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Meinicke P, Ritter H. Resolution-based complexity control for gaussian mixture models. Neural Comput...
Generalised linear models for multi-class classification problems are one of the fundamental buildin...
Abstract: In the domain of unsupervised learning, mixtures of Gaussians have become a popular tool f...
<p>While several papers have investigated computationally and statistically efficient methods for le...
Abstract. Gaussian mixture models are a widespread tool for mod-eling various and complex probabilit...
Abstract How can we train a statistical mixture model on a massive data set? In this paper, we show ...
How can we train a statistical mixture model on a massive data set? In this work we show how to cons...
Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately estimate...
Mixtures of gaussian (or normal) distributions arise in a variety of application areas. Many techniq...
In this paper we show that very large mixtures of Gaussians are efficiently learnable in high dimens...
University of AmsterdamWe present a deterministic greedy method to learn a mixture of Gaussians whic...
For every epsilon > 0, we give an efficient algorithm to learn the cluster centers of a mixture of p...
We provide an algorithm for properly learning mixtures of two single-dimensional Gaussians with-out ...
Mixtures of Gaussian (or normal) distributions arise in a variety of application areas. Many heurist...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Meinicke P, Ritter H. Resolution-based complexity control for gaussian mixture models. Neural Comput...
Generalised linear models for multi-class classification problems are one of the fundamental buildin...
Abstract: In the domain of unsupervised learning, mixtures of Gaussians have become a popular tool f...
<p>While several papers have investigated computationally and statistically efficient methods for le...
Abstract. Gaussian mixture models are a widespread tool for mod-eling various and complex probabilit...