Stability has been considered an important property for evaluating clustering solutions. Nevertheless, there are no conclusive studies on the relationship between this property and the capacity to recover clusters inherent to data (“ground truth”). This study focuses on this relationship, resorting to experiments on synthetic data generated under diverse scenarios (controlling relevant factors) and experiments on real data sets. Stability is evaluated using a weighted cross-validation procedure. Indices of agreement (corrected for agreement by chance) are used both to assess stability and external validity. The results obtained reveal a new perspective so far not mentioned in the literature. Despite the clear relationship between stability ...
A key issue in cluster analysis is the choice of an appropriate clustering method and the determinat...
Cluster validity investigates whether generated clusters are true clusters or due to chance. This is...
In cluster analysis, selecting the number of clusters is an "ill-posed" problem of crucial importanc...
Stability has been considered an important property for evaluating clustering solutions. Nevertheles...
Over the past few years, the notion of stability in data clustering has received growing attention a...
A popular method for selecting the number of clusters is based on stability arguments: one chooses t...
A popular method for selecting the number of clusters is based on sta-bility arguments: one chooses ...
In this work, a novel technique to address the problem of cluster validation based on cluster stabil...
Stability is a common tool to verify the validity of sample based algorithms. In clustering it is wi...
Stability is a common tool to verify the validity of sample based algorithms. In clustering it is wi...
*These authors contributed equally Abstract—Many different clustering algorithms have been developed...
Typically clustering algorithms provide clustering solutions with prespecified number of clusters. T...
In the present paper we compare clustering solutions using indices of paired agreement. We propose a...
An important problem in clustering research is the stability of sample clusters. Cluster diagnostics...
A unified theory is presented to assess the robustness of general clustering methods (GCM), i.e., me...
A key issue in cluster analysis is the choice of an appropriate clustering method and the determinat...
Cluster validity investigates whether generated clusters are true clusters or due to chance. This is...
In cluster analysis, selecting the number of clusters is an "ill-posed" problem of crucial importanc...
Stability has been considered an important property for evaluating clustering solutions. Nevertheles...
Over the past few years, the notion of stability in data clustering has received growing attention a...
A popular method for selecting the number of clusters is based on stability arguments: one chooses t...
A popular method for selecting the number of clusters is based on sta-bility arguments: one chooses ...
In this work, a novel technique to address the problem of cluster validation based on cluster stabil...
Stability is a common tool to verify the validity of sample based algorithms. In clustering it is wi...
Stability is a common tool to verify the validity of sample based algorithms. In clustering it is wi...
*These authors contributed equally Abstract—Many different clustering algorithms have been developed...
Typically clustering algorithms provide clustering solutions with prespecified number of clusters. T...
In the present paper we compare clustering solutions using indices of paired agreement. We propose a...
An important problem in clustering research is the stability of sample clusters. Cluster diagnostics...
A unified theory is presented to assess the robustness of general clustering methods (GCM), i.e., me...
A key issue in cluster analysis is the choice of an appropriate clustering method and the determinat...
Cluster validity investigates whether generated clusters are true clusters or due to chance. This is...
In cluster analysis, selecting the number of clusters is an "ill-posed" problem of crucial importanc...