Efficient Sparse Clustering of High-Dimensional Non-spherical Gaussian Mixtures

Martin Azizyan
Aarti Singh
Larry Wasserman

Publication date

January 2014

Abstract

We consider the problem of clustering data points in high dimensions, i.e. when the number of data points may be much smaller than the number of dimensions. Specifically, we consider a Gaussian mixture model (GMM) with non-spherical Gaussian components, where the clusters are distinguished by only a few relevant dimensions. The method we propose is a combination of a recent approach for learning parameters of a Gaus-sian mixture model and sparse linear discriminant analysis (LDA). In addition to cluster assignments, the method returns an estimate of the set of features relevant for clustering. Our results indicate that the sample complexity of clustering depends on the sparsity of the relevant feature set, while only scaling logarithmically...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Efficient Sparse Clustering of High-Dimensional Non-spherical Gaussian Mixtures

Abstract

Extracted data

Efficient Sparse Clustering of High-Dimensional Non-spherical Gaussian Mixtures

Abstract

Extracted data

Related items

Related items