A mixture of Gaussians fit to a single curved or heavy-tailed cluster will report that the data contains many clusters. To produce more appropriate clusterings, we introduce a model which warps a latent mixture of Gaussians to produce nonparametric cluster shapes. The possibly low-dimensional latent mixture model allows us to summarize the properties of the high-dimensional clusters (or density manifolds) describing the data. The number of manifolds, as well as the shape and dimension of each manifold is automat-ically inferred. We derive a simple inference scheme for this model which analytically inte-grates out both the mixture parameters and the warping function. We show that our model is effective for density estimation, per-forms bette...
The density-based formulation aims at recasting the clustering problem to a mathematically sound fra...
A new clustering approach based on mode identification is developed by applying new optimization tec...
An interesting problem, not often faced under the Bayesian nonparametric framework, is the clusterin...
A mixture of Gaussians fit to a single curved or heavy-tailed cluster will report that the data cont...
A mixture of Gaussians fit to a single curved or heavy-tailed cluster will report that the data cont...
none2In this paper, we propose a method to group a set of probability density functions (pdfs) into ...
We consider the problem of clustering data points in high dimensions, i.e. when the number of data p...
We propose a novel method for multiple clustering, which is useful for analysis of high-dimensional ...
Latent variable models are used extensively in unsupervised learning within the Bayesian paradigm, t...
Variable selection is an important problem for cluster analysis of high-dimensional data. It is also...
We are interested in clustering data whose support is “curved”. Recently we have addressed this prob...
We are interested in clustering data whose support is “curved”. Recently we have addressed this prob...
A BSTRACT. We are interested in clustering data whose support is “curved”. Recently we have ad- dres...
We present a novel algorithm called PG-means which is able to learn the number of clusters in a clas...
The first part of this thesis is concerned with Sparse Clustering, which assumes that a potentially ...
The density-based formulation aims at recasting the clustering problem to a mathematically sound fra...
A new clustering approach based on mode identification is developed by applying new optimization tec...
An interesting problem, not often faced under the Bayesian nonparametric framework, is the clusterin...
A mixture of Gaussians fit to a single curved or heavy-tailed cluster will report that the data cont...
A mixture of Gaussians fit to a single curved or heavy-tailed cluster will report that the data cont...
none2In this paper, we propose a method to group a set of probability density functions (pdfs) into ...
We consider the problem of clustering data points in high dimensions, i.e. when the number of data p...
We propose a novel method for multiple clustering, which is useful for analysis of high-dimensional ...
Latent variable models are used extensively in unsupervised learning within the Bayesian paradigm, t...
Variable selection is an important problem for cluster analysis of high-dimensional data. It is also...
We are interested in clustering data whose support is “curved”. Recently we have addressed this prob...
We are interested in clustering data whose support is “curved”. Recently we have addressed this prob...
A BSTRACT. We are interested in clustering data whose support is “curved”. Recently we have ad- dres...
We present a novel algorithm called PG-means which is able to learn the number of clusters in a clas...
The first part of this thesis is concerned with Sparse Clustering, which assumes that a potentially ...
The density-based formulation aims at recasting the clustering problem to a mathematically sound fra...
A new clustering approach based on mode identification is developed by applying new optimization tec...
An interesting problem, not often faced under the Bayesian nonparametric framework, is the clusterin...