It is well known that some local search heuristics for K-clustering problems, such as k-means heuristic for minimum sum-of-squares clustering occasionally stop at a solution with a smaller number of clusters than the desired number K. Such solutions are called degenerate. In this paper, we reveal that the degeneracy also exists in K-harmonic means (KHM) method, proposed as an alternative to K-means heuristic, but which is less sensitive to the initial solution. In addition, we discover two types of degenerate solutions and provide examples for both. Based on these findings, we give a simple method to remove degeneracy during the execution of the KHM heuristic; it can be used as a part of any other heuristic for KHM clustering proble...
Abstract: Clustering is a data mining (machine learning), unsupervised learning technique used to pl...
Clustering, particularly hierarchical clustering, is an important method for understanding and analy...
Knowledge discovery from data can be broadly categorized into two types: supervised and unsupervised...
This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.Al...
The K-harmonic means clustering algorithm (KHM) is a new clustering method used to group data such t...
Clustering is a process of extracting reliable, unique, effective and comprehensible patterns from d...
A new local search heuristic, called J-Means, is proposed for solving the minimum sum-of-squares clu...
We investigate here the behavior of the standard k-means clustering algorithm and several alternativ...
Among the data clustering algorithms, the k-means (KM) algorithm is one of the most popular clusteri...
Clustering is an important task in data mining. It can be formulated as a global optimization proble...
Abstract—Due to the explosion in the number of autonomous data sources, there is a growing need for ...
In this article the new hybrid data clustering approach, Gravitational Genetic KHM, based on Genetic...
The k-means clustering problem asks to partition the data into k clusters so as to minimize the sum ...
We review the performance function associated with the familiar K-Means algorithm and that of the re...
Clustering is a popular data analysis and data mining problem. Symmetry can be considered as a pre-a...
Abstract: Clustering is a data mining (machine learning), unsupervised learning technique used to pl...
Clustering, particularly hierarchical clustering, is an important method for understanding and analy...
Knowledge discovery from data can be broadly categorized into two types: supervised and unsupervised...
This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.Al...
The K-harmonic means clustering algorithm (KHM) is a new clustering method used to group data such t...
Clustering is a process of extracting reliable, unique, effective and comprehensible patterns from d...
A new local search heuristic, called J-Means, is proposed for solving the minimum sum-of-squares clu...
We investigate here the behavior of the standard k-means clustering algorithm and several alternativ...
Among the data clustering algorithms, the k-means (KM) algorithm is one of the most popular clusteri...
Clustering is an important task in data mining. It can be formulated as a global optimization proble...
Abstract—Due to the explosion in the number of autonomous data sources, there is a growing need for ...
In this article the new hybrid data clustering approach, Gravitational Genetic KHM, based on Genetic...
The k-means clustering problem asks to partition the data into k clusters so as to minimize the sum ...
We review the performance function associated with the familiar K-Means algorithm and that of the re...
Clustering is a popular data analysis and data mining problem. Symmetry can be considered as a pre-a...
Abstract: Clustering is a data mining (machine learning), unsupervised learning technique used to pl...
Clustering, particularly hierarchical clustering, is an important method for understanding and analy...
Knowledge discovery from data can be broadly categorized into two types: supervised and unsupervised...