We investigate the role of the initialization for the stability of the қ-means clustering algorithm. As opposed to other papers, we consider the actual қ-means algorithm (also known as Lloyd algorithm). In particular we leverage on the property that this algorithm can get stuck in local optima of the қ-means objective function. We are interested in the actual clustering, not only in the costs of the solution. We analyze when different initializations lead to the same local optimum, and when they lead to different local optima. This enables us to prove that it is reasonable to select the number of clusters based on stability scores
215 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2004.The study of the properties o...
Optimal clustering is a notoriously hard task. Recently, several papers have suggested a new approac...
The k-means algorithm is a popular clustering method used in many different fields of computer scien...
We investigate the role of the initialization for the stability of the қ-means clustering algorithm....
K-Means is one of the most popular clustering algorithms, and it is easy to implement It seeks to m...
k-means is a simple and flexible clustering algorithm that has remained in common use for 50+ years....
In the context of unsupervised learning, Lloyd's algorithm is one of the most widely used clustering...
We phrase K-means clustering as an empirical risk minimization procedure over a class HK and explici...
A paradox for “k-means clustering” k-means objective φ of C = {ci, i ∈ [k]} on a dataset X: φX(C) = ...
Stability is a common tool to verify the validity of sample based algorithms. In clustering it is wi...
Stability is a common tool to verify the validity of sample based algorithms. In clustering it is wi...
A popular method for selecting the number of clusters is based on stability arguments: one chooses t...
It is well known that the clusters produced by a clustering algorithm depend on the chosen initial c...
With the hypothesis of Gaussian distribution of patterns, K-means and its extensions are good for cl...
Abstract — The famous K-means clustering algorithm is sensitive to the selection of the initial cent...
215 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2004.The study of the properties o...
Optimal clustering is a notoriously hard task. Recently, several papers have suggested a new approac...
The k-means algorithm is a popular clustering method used in many different fields of computer scien...
We investigate the role of the initialization for the stability of the қ-means clustering algorithm....
K-Means is one of the most popular clustering algorithms, and it is easy to implement It seeks to m...
k-means is a simple and flexible clustering algorithm that has remained in common use for 50+ years....
In the context of unsupervised learning, Lloyd's algorithm is one of the most widely used clustering...
We phrase K-means clustering as an empirical risk minimization procedure over a class HK and explici...
A paradox for “k-means clustering” k-means objective φ of C = {ci, i ∈ [k]} on a dataset X: φX(C) = ...
Stability is a common tool to verify the validity of sample based algorithms. In clustering it is wi...
Stability is a common tool to verify the validity of sample based algorithms. In clustering it is wi...
A popular method for selecting the number of clusters is based on stability arguments: one chooses t...
It is well known that the clusters produced by a clustering algorithm depend on the chosen initial c...
With the hypothesis of Gaussian distribution of patterns, K-means and its extensions are good for cl...
Abstract — The famous K-means clustering algorithm is sensitive to the selection of the initial cent...
215 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2004.The study of the properties o...
Optimal clustering is a notoriously hard task. Recently, several papers have suggested a new approac...
The k-means algorithm is a popular clustering method used in many different fields of computer scien...