Motivated by the poor performance (linear complexity) of the EM algorithm in clustering large data sets, and inspired by the successful accelerated versions of related algorithms like k-means, we derive an accelerated variant of the EM algorithm for Gaussian mixtures that: (1) offers speedups that are at least linear in the number of data points, (2) ensures convergence by strictly increasing a lower bound on the data log-likelihood in each learning step, and (3) allows ample freedom in the design of other accelerated variants. We also derive a similar accelerated algorithm for greedy mixture learning, where very satisfactory results are obtained. The core idea is to define a lower bound on the data log-likelihood based on a grouping of dat...
We present two scalable model-based clustering systems based on a Gaussian mixture model with indepe...
This work proposes an exponential computation with low-computational complexity and applies this tec...
The Expectation–Maximization (EM) algorithm is a popular tool in a wide variety of statistical setti...
peer reviewedMotivated by the poor performance (linear complexity) of the EM algorithm in clustering...
Due to the existence of a large number of sample data which obey the Gaussian distribution,GMM (Gaus...
Mixture models become increasingly popular due to their modeling flexibility and are applied to the ...
peer reviewedLearning a Gaussian mixture with a local algorithm like EM can be difficult because (i)...
Abstract. The EM algorithm for Gaussian mixture models often gets caught in local maxima of the like...
Within the learning framework of maximum weighted likelihood (MWL) proposed by Cheung, 2004 and 2005...
Cluster analysis faces two problems in high dimensions: first, the “curse of di-mensionality ” that ...
Address email Clustering is often formulated as the maximum likelihood estimation of a mixture model...
Finite mixture models are being increasingly used to model the distributions of a wide variety of ra...
This paper represents a preliminary (pre-reviewing) version of a sublinear variational algorithm for...
In Chapter 1 we give a general introduction and motivate the need for clustering and dimension reduc...
Abstract: In this paper deals with clustering models based on the Gaussian Mixtures. Parameters are ...
We present two scalable model-based clustering systems based on a Gaussian mixture model with indepe...
This work proposes an exponential computation with low-computational complexity and applies this tec...
The Expectation–Maximization (EM) algorithm is a popular tool in a wide variety of statistical setti...
peer reviewedMotivated by the poor performance (linear complexity) of the EM algorithm in clustering...
Due to the existence of a large number of sample data which obey the Gaussian distribution,GMM (Gaus...
Mixture models become increasingly popular due to their modeling flexibility and are applied to the ...
peer reviewedLearning a Gaussian mixture with a local algorithm like EM can be difficult because (i)...
Abstract. The EM algorithm for Gaussian mixture models often gets caught in local maxima of the like...
Within the learning framework of maximum weighted likelihood (MWL) proposed by Cheung, 2004 and 2005...
Cluster analysis faces two problems in high dimensions: first, the “curse of di-mensionality ” that ...
Address email Clustering is often formulated as the maximum likelihood estimation of a mixture model...
Finite mixture models are being increasingly used to model the distributions of a wide variety of ra...
This paper represents a preliminary (pre-reviewing) version of a sublinear variational algorithm for...
In Chapter 1 we give a general introduction and motivate the need for clustering and dimension reduc...
Abstract: In this paper deals with clustering models based on the Gaussian Mixtures. Parameters are ...
We present two scalable model-based clustering systems based on a Gaussian mixture model with indepe...
This work proposes an exponential computation with low-computational complexity and applies this tec...
The Expectation–Maximization (EM) algorithm is a popular tool in a wide variety of statistical setti...