Abstract: A fundamental and difficult problem in cluster analysis is the determination of the “true ” number of clusters in a dataset. The common trail-and-error method generally depends on certain clustering algorithms and is inefficient when processing large datasets. In this paper, a hierarchical method is proposed to get rid of repeatedly clustering on large datasets. The method firstly obtains the CF (clustering feature) via scanning the dataset and agglomerative generates the hierarchical partitions of dataset, then a curve of the clustering quality w.r.t the varying partitions is incrementally constructed. The partitions corresponding to the extremum of the curve is used to estimate the number of clusters finally. A new validity ind...
A large number of classification and clustering methods for defining and calculating optimal or well...
Determining the number of clusters in a dataset has been one of the most challenging problems in clu...
Clustering analysis seeks to partition a given dataset into groups or clusters so that the data obje...
Determining an optimal number of clusters and producing reliable results are two challenging and cri...
The objective of data mining is to take out information from large amounts of data and convert it in...
Clustering is one of the most popular artificial intelligence techniques which aims at identifying g...
Abstract — Clustering techniques have a wide use and importance nowadays. This importance tends to i...
Clustering analysis is one of the most commonly used techniques for uncovering patterns in data mini...
We propose a clustering method which produces valid results while automatically determining an optim...
Techniques based on agglomerative hierarchical clustering constitute one of the most frequent approa...
Clustering aims to differentiate objects from different groups (clusters) by similarities or distanc...
Determining intrinsic number of clusters in a multidimensional dataset is a commonly encountered pro...
Hierarchical clustering constructs a hierarchy of clusters by either repeatedly merging two smaller ...
In the cluster analysis literature, there are several partitioning (non-hierarchical) methods for cl...
DNA microarray and gene expression problems often require a researcher to perform clustering on thei...
A large number of classification and clustering methods for defining and calculating optimal or well...
Determining the number of clusters in a dataset has been one of the most challenging problems in clu...
Clustering analysis seeks to partition a given dataset into groups or clusters so that the data obje...
Determining an optimal number of clusters and producing reliable results are two challenging and cri...
The objective of data mining is to take out information from large amounts of data and convert it in...
Clustering is one of the most popular artificial intelligence techniques which aims at identifying g...
Abstract — Clustering techniques have a wide use and importance nowadays. This importance tends to i...
Clustering analysis is one of the most commonly used techniques for uncovering patterns in data mini...
We propose a clustering method which produces valid results while automatically determining an optim...
Techniques based on agglomerative hierarchical clustering constitute one of the most frequent approa...
Clustering aims to differentiate objects from different groups (clusters) by similarities or distanc...
Determining intrinsic number of clusters in a multidimensional dataset is a commonly encountered pro...
Hierarchical clustering constructs a hierarchy of clusters by either repeatedly merging two smaller ...
In the cluster analysis literature, there are several partitioning (non-hierarchical) methods for cl...
DNA microarray and gene expression problems often require a researcher to perform clustering on thei...
A large number of classification and clustering methods for defining and calculating optimal or well...
Determining the number of clusters in a dataset has been one of the most challenging problems in clu...
Clustering analysis seeks to partition a given dataset into groups or clusters so that the data obje...