The assessment of stability in cluster analysis is strongly related to the main difficult problem of determining the number of clusters present in the data. The latter is subject of many investigations and papers considering different resampling techniques as practical tools. In this paper, we consider non-parametric resampling from the empirical distribution of a given dataset in order to investigate the stability of results of partitional clustering. In detail, we investigate here only the very popular K-means method. The estimation of the sampling distribution of the adjusted Rand index (ARI) and the averaged Jaccard index seems to be the most general way to do this. In addition, we compare bootstrapping with different subsampling scheme...
Cluster validity investigates whether generated clusters are true clusters or due to chance. This is...
The estimation of the appropriate number of clusters is a known problem in cluster anal-ysis, that a...
The aim of this work on cluster analysis is to provide a methodology to analyse and assess the quali...
The assessment of stability in cluster analysis is strongly related to the main difficult problem of...
Clustering is a challenging problem in unsupervised learning. In lieu of a gold standard, stability ...
An important problem in clustering research is the stability of sample clusters. Cluster diagnostics...
In cluster analysis, selecting the number of clusters is an "ill-posed" problem of crucial importanc...
In simulation studies based on many synthetic and real datasets, we found out that subsampling has a...
A popular method for selecting the number of clusters is based on stability arguments: one chooses t...
Abstract Background Hierarchical clustering is a widely applied tool in the analysis of microarray g...
A popular method for selecting the number of clusters is based on sta-bility arguments: one chooses ...
Segmentation results derived using cluster analysis depend on (1) the structure of the data and (2...
K-means clustering is widely used for exploratory data analysis. While its dependence on initialisat...
Segmentation results derived using cluster analysis depend on (1) the structure of the data and (2) ...
A unified theory is presented to assess the robustness of general clustering methods (GCM), i.e., me...
Cluster validity investigates whether generated clusters are true clusters or due to chance. This is...
The estimation of the appropriate number of clusters is a known problem in cluster anal-ysis, that a...
The aim of this work on cluster analysis is to provide a methodology to analyse and assess the quali...
The assessment of stability in cluster analysis is strongly related to the main difficult problem of...
Clustering is a challenging problem in unsupervised learning. In lieu of a gold standard, stability ...
An important problem in clustering research is the stability of sample clusters. Cluster diagnostics...
In cluster analysis, selecting the number of clusters is an "ill-posed" problem of crucial importanc...
In simulation studies based on many synthetic and real datasets, we found out that subsampling has a...
A popular method for selecting the number of clusters is based on stability arguments: one chooses t...
Abstract Background Hierarchical clustering is a widely applied tool in the analysis of microarray g...
A popular method for selecting the number of clusters is based on sta-bility arguments: one chooses ...
Segmentation results derived using cluster analysis depend on (1) the structure of the data and (2...
K-means clustering is widely used for exploratory data analysis. While its dependence on initialisat...
Segmentation results derived using cluster analysis depend on (1) the structure of the data and (2) ...
A unified theory is presented to assess the robustness of general clustering methods (GCM), i.e., me...
Cluster validity investigates whether generated clusters are true clusters or due to chance. This is...
The estimation of the appropriate number of clusters is a known problem in cluster anal-ysis, that a...
The aim of this work on cluster analysis is to provide a methodology to analyse and assess the quali...