Many iterative techniques are sensitive to the initial conditions, thus getting stuck in local optima. This paper explores two simple, computationally fast methods that allow the refinement of the initial points of k-means to cluster a given data set. They are based on alternating k-means and the search of the deepest (most representative) point of each cluster, and on the combination of bootstrapped k-means results and the search of the deepest points of each cluster on the space of cluster centers. In addition, this second alternative can be used to provide a soft clustering of the data assigning a cluster belongingness index to each element. The methods are tested in simulated and real data and prove to be efficient and faster than previ...
Traditional K-means algorithm's clustering effect is affected by the initial cluster center poin...
In this paper, the standard k-means algorithm has been improved in terms of the initial cluster cent...
The k-means clustering problem asks to partition the data into k clusters so as to minimize the sum ...
Abstract: Clustering is a data mining (machine learning), unsupervised learning technique used to pl...
K-means, a simple and effective clustering algorithm, is one of the most widely used algorithms-in c...
Cluster analysis method is one of the most analytical methods of data mining. The method will direct...
215 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2004.The study of the properties o...
Abstract: Initial starting points those generated randomly by K-means often make the clustering resu...
K-means, a simple and effective clustering algorithm, is one of the most widely used algorithms in c...
Abstract: K-Means is the most popular clustering algorithm with the convergence to one of numerous ...
AbstractIn k-means clustering we are given a set of n data points in d-dimensional space Rd and an i...
We [6, 7] have recently investigated several families of clustering algorithms. In this paper, we sh...
We present new algorithms for the k-means clustering problem. They use the kd-tree data structure to...
Clustering is a practical data mining approach of pattern detection. Because of the sensitivity of i...
Competent data mining methods are vital to discover knowledge from databases which are built as a re...
Traditional K-means algorithm's clustering effect is affected by the initial cluster center poin...
In this paper, the standard k-means algorithm has been improved in terms of the initial cluster cent...
The k-means clustering problem asks to partition the data into k clusters so as to minimize the sum ...
Abstract: Clustering is a data mining (machine learning), unsupervised learning technique used to pl...
K-means, a simple and effective clustering algorithm, is one of the most widely used algorithms-in c...
Cluster analysis method is one of the most analytical methods of data mining. The method will direct...
215 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2004.The study of the properties o...
Abstract: Initial starting points those generated randomly by K-means often make the clustering resu...
K-means, a simple and effective clustering algorithm, is one of the most widely used algorithms in c...
Abstract: K-Means is the most popular clustering algorithm with the convergence to one of numerous ...
AbstractIn k-means clustering we are given a set of n data points in d-dimensional space Rd and an i...
We [6, 7] have recently investigated several families of clustering algorithms. In this paper, we sh...
We present new algorithms for the k-means clustering problem. They use the kd-tree data structure to...
Clustering is a practical data mining approach of pattern detection. Because of the sensitivity of i...
Competent data mining methods are vital to discover knowledge from databases which are built as a re...
Traditional K-means algorithm's clustering effect is affected by the initial cluster center poin...
In this paper, the standard k-means algorithm has been improved in terms of the initial cluster cent...
The k-means clustering problem asks to partition the data into k clusters so as to minimize the sum ...