Initial seed artefacts play a vital role in proper categorization of the given data set in partitioning based clustering algorithms. Hence, it is important to identify them. We propose a density with distance based method which ensures identification of seed artefacts from different clusters that leads to more accurate clustering results. Our algorithm improves on the search for initial seed artefacts iteratively until the minimum value of the sum of within sum errors, normalized by their data sizes, is ensured. This is because the initial artefacts are selected from different clusters. Here the choice of seed artefacts guarantees a global optimum clustering solution. We have compared our results with random, Wu, Cao and Khan’s methods of i...
We present a nonparametric method for selecting informative features in high-dimensional clustering ...
Processing applications with a large number of dimensions has been a challenge to the KDD community....
Abstract: Clustering is a well known data mining technique which is used to group together data item...
Open issues with respect to K means algorithm are identifying the number of clusters, initial seed c...
AbstractIn this paper we present two clustering techniques called ModEx and Seed-Detective. ModEx is...
The partitional clustering technique, k-means, is one of the most computationally efficient clusteri...
Advances in recent techniques for scientific data collection in the era of big data allow for the sy...
Clustering is a practical data mining approach of pattern detection. Because of the sensitivity of i...
In this paper, we propose a density-based clusters' representatives selection algorithm that identif...
Partitioning clustering is generally performed using K-modes cluster algorithms, which work well for...
The k-means algorithm is one of the most popular clustering techniques because of its speed and simp...
Partition-based clustering technique is one of several clustering techniques that attempt to directl...
Explores the applicability of simulated annealing, a probabilistic search method, for finding optima...
Traditional K-means algorithm's clustering effect is affected by the initial cluster center poin...
Data mining is a technique which extracts the information from the large amount of data. To group th...
We present a nonparametric method for selecting informative features in high-dimensional clustering ...
Processing applications with a large number of dimensions has been a challenge to the KDD community....
Abstract: Clustering is a well known data mining technique which is used to group together data item...
Open issues with respect to K means algorithm are identifying the number of clusters, initial seed c...
AbstractIn this paper we present two clustering techniques called ModEx and Seed-Detective. ModEx is...
The partitional clustering technique, k-means, is one of the most computationally efficient clusteri...
Advances in recent techniques for scientific data collection in the era of big data allow for the sy...
Clustering is a practical data mining approach of pattern detection. Because of the sensitivity of i...
In this paper, we propose a density-based clusters' representatives selection algorithm that identif...
Partitioning clustering is generally performed using K-modes cluster algorithms, which work well for...
The k-means algorithm is one of the most popular clustering techniques because of its speed and simp...
Partition-based clustering technique is one of several clustering techniques that attempt to directl...
Explores the applicability of simulated annealing, a probabilistic search method, for finding optima...
Traditional K-means algorithm's clustering effect is affected by the initial cluster center poin...
Data mining is a technique which extracts the information from the large amount of data. To group th...
We present a nonparametric method for selecting informative features in high-dimensional clustering ...
Processing applications with a large number of dimensions has been a challenge to the KDD community....
Abstract: Clustering is a well known data mining technique which is used to group together data item...