We consider the problem of learning mixtures of distributions via spectral methods and derive a tight characterization of when such methods are useful. Specifically, given a mixture-sample, let i , C i , w i denote the empirical mean, covariance matrix, and mixing weight of the i-th component. We prove that a very simple algorithm, namely spectral projection followed by single-linkage clustering, properly classifies every point in the sample when each i is separated from all j by 2 (1/w i +1/w j ) plus a term that depends on the concentration properties of the distributions in the mixture. This second term is very small for many distributions, including Gaussians, Log-concave, and many others. As a result, we get the best ...
Abstract. This text is devoted to the asymptotic study of some spectral properties of the Gram matri...
This text is devoted to the asymptotic study of some spectral properties of the Gram matri...
Generalised linear models for multi-class classification problems are one of the fundamental buildin...
AbstractWe show that a simple spectral algorithm for learning a mixture of k spherical Gaussians in ...
Clustering analysis is a popular technique in statistics and machine learning. Despite its wide use,...
We study the problem of learning a distribution from samples, when the underlying distribution is a ...
International audienceGaussian Mixture Models are widely used nowadays, thanks to the simplicity and...
Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately estimate...
Spectral clustering is one of the most popular algorithms to group high dimensional data. It is easy...
We provide an algorithm for properly learning mixtures of two single-dimensional Gaussians with-out ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
<p>While several papers have investigated computationally and statistically efficient methods for le...
One of the main tasks in kernel methods is the selection of adequate mappings into higher dimension ...
In this article, we present a strategy for producing low-dimensional projections that maximally sepa...
In this paper we consider the problem of learning a mixture of permutations, where each component of...
Abstract. This text is devoted to the asymptotic study of some spectral properties of the Gram matri...
This text is devoted to the asymptotic study of some spectral properties of the Gram matri...
Generalised linear models for multi-class classification problems are one of the fundamental buildin...
AbstractWe show that a simple spectral algorithm for learning a mixture of k spherical Gaussians in ...
Clustering analysis is a popular technique in statistics and machine learning. Despite its wide use,...
We study the problem of learning a distribution from samples, when the underlying distribution is a ...
International audienceGaussian Mixture Models are widely used nowadays, thanks to the simplicity and...
Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately estimate...
Spectral clustering is one of the most popular algorithms to group high dimensional data. It is easy...
We provide an algorithm for properly learning mixtures of two single-dimensional Gaussians with-out ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
<p>While several papers have investigated computationally and statistically efficient methods for le...
One of the main tasks in kernel methods is the selection of adequate mappings into higher dimension ...
In this article, we present a strategy for producing low-dimensional projections that maximally sepa...
In this paper we consider the problem of learning a mixture of permutations, where each component of...
Abstract. This text is devoted to the asymptotic study of some spectral properties of the Gram matri...
This text is devoted to the asymptotic study of some spectral properties of the Gram matri...
Generalised linear models for multi-class classification problems are one of the fundamental buildin...