K-means is one of the most popular and widespread partitioning clustering algorithms due to its superior scalability and efficiency. Typically, the K-means algorithm treats all features fairly and sets weights of all features equally when evaluating dissimilarity. However, a meaningful clustering phenomenon often occurs in a subspace defined by a specific subset of all features. To address this issue, this paper proposes a novel feature weight self-adjustment (FWSA) mechanism embedded into K-means in order to improve the clustering quality of K-means. In the FWSA mechanism, finding feature weights is modeled as an optimization problem to simultaneously minimize the separations within clusters and maximize the separations between clusters. W...
Processing applications with a large number of dimensions has been a challenge to the KDD community....
Advances in recent techniques for scientific data collection in the era of big data allow for the sy...
The feature selection represents a key step in mining high-dimensional data: the significance of fea...
Cluster analysis is a statistical analysis technique that divides the research objects into relative...
In a real-world data set there is always the possibility, rather high in our opinion, that different...
The aim of feature reduction is reduction of the size of data file, elimination of irrelevant featur...
We present a novel feature selection algorithm for the k-means clustering problem. Our algorithm is ...
International audienceSimultaneous selection of the number of clusters and of a relevant subset of f...
Abstract—This paper proposes a k-means type clustering algorithm that can automatically calculate va...
Abstract—This paper introduces an optimized version of the standard K-Means algorithm. The optimizat...
This paper proposes a k-means type clustering algorithm that can automatically calculate variable we...
K-Means is one of the most popular clustering algorithms, and it is easy to implement It seeks to m...
Mean shift is a simple interactive procedure that gradually shifts data points towards the mode whic...
The K-means algorithm is one of the ten classic algorithms in the area of data mining and has been s...
An appropriate distance is an essential ingredient in various real-world learning tasks. Distance me...
Processing applications with a large number of dimensions has been a challenge to the KDD community....
Advances in recent techniques for scientific data collection in the era of big data allow for the sy...
The feature selection represents a key step in mining high-dimensional data: the significance of fea...
Cluster analysis is a statistical analysis technique that divides the research objects into relative...
In a real-world data set there is always the possibility, rather high in our opinion, that different...
The aim of feature reduction is reduction of the size of data file, elimination of irrelevant featur...
We present a novel feature selection algorithm for the k-means clustering problem. Our algorithm is ...
International audienceSimultaneous selection of the number of clusters and of a relevant subset of f...
Abstract—This paper proposes a k-means type clustering algorithm that can automatically calculate va...
Abstract—This paper introduces an optimized version of the standard K-Means algorithm. The optimizat...
This paper proposes a k-means type clustering algorithm that can automatically calculate variable we...
K-Means is one of the most popular clustering algorithms, and it is easy to implement It seeks to m...
Mean shift is a simple interactive procedure that gradually shifts data points towards the mode whic...
The K-means algorithm is one of the ten classic algorithms in the area of data mining and has been s...
An appropriate distance is an essential ingredient in various real-world learning tasks. Distance me...
Processing applications with a large number of dimensions has been a challenge to the KDD community....
Advances in recent techniques for scientific data collection in the era of big data allow for the sy...
The feature selection represents a key step in mining high-dimensional data: the significance of fea...