Stability is a common tool to verify the validity of sample based algorithms. In clustering it is widely used to tune the parameters of the algorithm, such as the number k of clusters. In spite of the popularity of stability in practical applications, there has been very little theoretical analysis of this notion. In this paper we provide a formal definition of stability and analyze some of its basic properties. Quite surprisingly, the conclusion of our analysis is that for large sample size, stability is fully determined by the behavior of the objective function which the clustering algorithm is aiming to minimize. If the objective function has a unique global minimizer, the algorithm is stable, otherwise it is unstable. In particular we c...
K-means clustering is widely used for exploratory data analysis. While its dependence on initialisat...
Stability has been considered an important property for evaluating clustering solutions. Nevertheles...
A novel center-based clustering algorithm is proposed in this paper. We first for-mulate clustering ...
Stability is a common tool to verify the validity of sample based algorithms. In clustering it is wi...
Stability is a common tool to verify the validity of sample based algorithms. In clustering it is wi...
A popular method for selecting the number of clusters is based on stability arguments: one chooses t...
In this paper, we investigate stability-based methods for cluster model selection, in particular to ...
A popular method for selecting the number of clusters is based on sta-bility arguments: one chooses ...
Over the past few years, the notion of stability in data clustering has received growing attention a...
We phrase K-means clustering as an empirical risk minimization procedure over a class HK and explici...
Optimal clustering is a notoriously hard task. Recently, several papers have suggested a new approac...
High accuracy of the results is very important task in any grouping problem (clustering). It determi...
Among the areas of data and text mining which are employed today in OR, science, economy and technol...
A unified theory is presented to assess the robustness of general clustering methods (GCM), i.e., me...
We improve instability-based methods for the selection of the number of clusters k in cluster analys...
K-means clustering is widely used for exploratory data analysis. While its dependence on initialisat...
Stability has been considered an important property for evaluating clustering solutions. Nevertheles...
A novel center-based clustering algorithm is proposed in this paper. We first for-mulate clustering ...
Stability is a common tool to verify the validity of sample based algorithms. In clustering it is wi...
Stability is a common tool to verify the validity of sample based algorithms. In clustering it is wi...
A popular method for selecting the number of clusters is based on stability arguments: one chooses t...
In this paper, we investigate stability-based methods for cluster model selection, in particular to ...
A popular method for selecting the number of clusters is based on sta-bility arguments: one chooses ...
Over the past few years, the notion of stability in data clustering has received growing attention a...
We phrase K-means clustering as an empirical risk minimization procedure over a class HK and explici...
Optimal clustering is a notoriously hard task. Recently, several papers have suggested a new approac...
High accuracy of the results is very important task in any grouping problem (clustering). It determi...
Among the areas of data and text mining which are employed today in OR, science, economy and technol...
A unified theory is presented to assess the robustness of general clustering methods (GCM), i.e., me...
We improve instability-based methods for the selection of the number of clusters k in cluster analys...
K-means clustering is widely used for exploratory data analysis. While its dependence on initialisat...
Stability has been considered an important property for evaluating clustering solutions. Nevertheles...
A novel center-based clustering algorithm is proposed in this paper. We first for-mulate clustering ...