We study the problem of learning nonparametric distributions in a finite mixture, and establish tight bounds on the sample complexity for learning the component distributions in such models. Namely, we are given i.i.d. samples from a pdf $f$ where $$ f=\sum_{i=1}^k w_i f_i, \quad\sum_{i=1}^k w_i=1, \quad w_i>0 $$ and we are interested in learning each component $f_i$. Without any assumptions on $f_i$, this problem is ill-posed. In order to identify the components $f_i$, we assume that each $f_i$ can be written as a convolution of a Gaussian and a compactly supported density $\nu_i$ with $\text{supp}(\nu_i)\cap \text{supp}(\nu_j)=\emptyset$. Our main result shows that $(\frac{1}{\varepsilon})^{\Omega(\log\log \frac{1}{\varepsilon})}$ sampl...
We provide an algorithm for properly learning mixtures of two single-dimensional Gaussians with-out ...
We study the problem of learning a distribution from samples, when the underlying distribution is a ...
Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately estimate...
We study the problem of learning nonparametric distributions in a finite mixture, and establish tigh...
We study the problem of learning from unlabeled samples very general statistical mixture models on l...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
We consider PAC learning of probability distributions (a.k.a. density estimation), where we are give...
We study the problem of learning mixtures of linear classifiers under Gaussian covariates. Given sam...
We consider the problem of learning mixtures of product distributions over discrete domains in the d...
We give an algorithm for learning a mixture of unstructured distributions. This problem arises in va...
Known algorithms for learning PDFA can only be shown to run in time polynomial in the so-called dis...
Mixture models form one of the most fundamental classes of generative models for clustered data...
In this paper we show that very large mixtures of Gaussians are efficiently learnable in high dimens...
Statistical learning with missing or hidden information is ubiquitous in many practical problems. Fo...
We provide guarantees for learning latent variable models emphasizing on the overcomplete regime, wh...
We provide an algorithm for properly learning mixtures of two single-dimensional Gaussians with-out ...
We study the problem of learning a distribution from samples, when the underlying distribution is a ...
Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately estimate...
We study the problem of learning nonparametric distributions in a finite mixture, and establish tigh...
We study the problem of learning from unlabeled samples very general statistical mixture models on l...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
We consider PAC learning of probability distributions (a.k.a. density estimation), where we are give...
We study the problem of learning mixtures of linear classifiers under Gaussian covariates. Given sam...
We consider the problem of learning mixtures of product distributions over discrete domains in the d...
We give an algorithm for learning a mixture of unstructured distributions. This problem arises in va...
Known algorithms for learning PDFA can only be shown to run in time polynomial in the so-called dis...
Mixture models form one of the most fundamental classes of generative models for clustered data...
In this paper we show that very large mixtures of Gaussians are efficiently learnable in high dimens...
Statistical learning with missing or hidden information is ubiquitous in many practical problems. Fo...
We provide guarantees for learning latent variable models emphasizing on the overcomplete regime, wh...
We provide an algorithm for properly learning mixtures of two single-dimensional Gaussians with-out ...
We study the problem of learning a distribution from samples, when the underlying distribution is a ...
Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately estimate...