We study the problem of learning mixtures of linear classifiers under Gaussian covariates. Given sample access to a mixture of $r$ distributions on $\mathbb{R}^n$ of the form $(\mathbf{x},y_{\ell})$, $\ell\in [r]$, where $\mathbf{x}\sim\mathcal{N}(0,\mathbf{I}_n)$ and $y_\ell=\mathrm{sign}(\langle\mathbf{v}_\ell,\mathbf{x}\rangle)$ for an unknown unit vector $\mathbf{v}_\ell$, the goal is to learn the underlying distribution in total variation distance. Our main result is a Statistical Query (SQ) lower bound suggesting that known algorithms for this problem are essentially best possible, even for the special case of uniform mixtures. In particular, we show that the complexity of any SQ algorithm for the problem is $n^{\mathrm{poly}(1/\Delta...
We consider PAC learning of probability distributions (a.k.a. density estimation), where we are give...
Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately estimate...
In this paper we show that very large mixtures of Gaussians are efficiently learnable in high dimens...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Presented on September 18, 2017 at 11:00 a.m. in the Klaus Advanced Computing Building, Room 1116E.I...
We provide an algorithm for properly learning mixtures of two single-dimensional Gaussians with-out ...
Modeling with mixtures is a powerful method in the statistical toolkit that can be used for represen...
We consider the problem of learning mixtures of product distributions over discrete domains in the d...
Mixture models form one of the most fundamental classes of generative models for clustered data...
We consider the problem of identifying the parameters of an unknown mixture of two ar-bitrary d-dime...
AbstractWe show that a simple spectral algorithm for learning a mixture of k spherical Gaussians in ...
We establish optimal Statistical Query (SQ) lower bounds for robustly learning certain families of d...
AbstractWe prove two lower bounds in the statistical query (SQ) learning model. The first lower boun...
We study the problem of learning nonparametric distributions in a finite mixture, and establish tigh...
Statistical learning with missing or hidden information is ubiquitous in many practical problems. Fo...
We consider PAC learning of probability distributions (a.k.a. density estimation), where we are give...
Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately estimate...
In this paper we show that very large mixtures of Gaussians are efficiently learnable in high dimens...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Presented on September 18, 2017 at 11:00 a.m. in the Klaus Advanced Computing Building, Room 1116E.I...
We provide an algorithm for properly learning mixtures of two single-dimensional Gaussians with-out ...
Modeling with mixtures is a powerful method in the statistical toolkit that can be used for represen...
We consider the problem of learning mixtures of product distributions over discrete domains in the d...
Mixture models form one of the most fundamental classes of generative models for clustered data...
We consider the problem of identifying the parameters of an unknown mixture of two ar-bitrary d-dime...
AbstractWe show that a simple spectral algorithm for learning a mixture of k spherical Gaussians in ...
We establish optimal Statistical Query (SQ) lower bounds for robustly learning certain families of d...
AbstractWe prove two lower bounds in the statistical query (SQ) learning model. The first lower boun...
We study the problem of learning nonparametric distributions in a finite mixture, and establish tigh...
Statistical learning with missing or hidden information is ubiquitous in many practical problems. Fo...
We consider PAC learning of probability distributions (a.k.a. density estimation), where we are give...
Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately estimate...
In this paper we show that very large mixtures of Gaussians are efficiently learnable in high dimens...