Abstract—Due to the explosion in the number of autonomous data sources, there is a growing need for effective approaches for distributed knowledge discovery and data mining. The distributed clustering algorithm is used to cluster the distributed datasets without necessarily downloading all the data to a single site. K-Means is used as a popular clustering method due to its simplicity and high speed in clustering large datasets. The dependency of the K-Means performance on the initialization of centroids is a major problem. Similarly, distributed clustering algorithm based on K-Means is also sensitive to centroid initialization. It is demonstrated that K-Harmonic Means is essentially insensitive to centroid initialization. In this paper, a n...
Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects i...
Abstract — Clustering is a collection of objects which are similar between them and dissimilar to th...
At present, the explosive growth of data and the mass storage state have brought many problems such ...
<p>Clustering has been recognized as the unsupervised classification of data items into groups. Due ...
Clustering is a process of extracting reliable, unique, effective and comprehensible patterns from d...
The K-harmonic means clustering algorithm (KHM) is a new clustering method used to group data such t...
Abstract-Clustering is one of the important methods in data mining to weight the distance between th...
We investigate here the behavior of the standard k-means clustering algorithm and several alternativ...
The K-Means algorithm for cluster analysis is one of the most influential and popular data mining me...
Among the data clustering algorithms, the k-means (KM) algorithm is one of the most popular clusteri...
K-means is an iterative algorithm used with clustering task. It has more characteristics such as sim...
Dealing with big amounts of data is one of the challenges for clustering, which causes the need for ...
This paper provides new algorithms for distributed clustering for two popular center-based objec-tiv...
It is well known that some local search heuristics for K-clustering problems, such as k-means heuri...
One of the challenges for clustering resides in dealing with data distributed in separated repositor...
Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects i...
Abstract — Clustering is a collection of objects which are similar between them and dissimilar to th...
At present, the explosive growth of data and the mass storage state have brought many problems such ...
<p>Clustering has been recognized as the unsupervised classification of data items into groups. Due ...
Clustering is a process of extracting reliable, unique, effective and comprehensible patterns from d...
The K-harmonic means clustering algorithm (KHM) is a new clustering method used to group data such t...
Abstract-Clustering is one of the important methods in data mining to weight the distance between th...
We investigate here the behavior of the standard k-means clustering algorithm and several alternativ...
The K-Means algorithm for cluster analysis is one of the most influential and popular data mining me...
Among the data clustering algorithms, the k-means (KM) algorithm is one of the most popular clusteri...
K-means is an iterative algorithm used with clustering task. It has more characteristics such as sim...
Dealing with big amounts of data is one of the challenges for clustering, which causes the need for ...
This paper provides new algorithms for distributed clustering for two popular center-based objec-tiv...
It is well known that some local search heuristics for K-clustering problems, such as k-means heuri...
One of the challenges for clustering resides in dealing with data distributed in separated repositor...
Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects i...
Abstract — Clustering is a collection of objects which are similar between them and dissimilar to th...
At present, the explosive growth of data and the mass storage state have brought many problems such ...