While data clustering has a long history and a large amount of research has been devoted to the development of numerous clustering techniques, significant challenges still remain. One of the most important of them is associated with high data dimensionality. A particular class of clustering algorithms has been very successful in dealing with such datasets, utilising information driven by the principal component analysis. In this work, we try to deepen our understanding on what can be achieved by this kind of approaches. We attempt to theoretically discover the relationship between true clusters in the data and the distribution of their projection onto the principal components. Based on such findings, we propose appropriate criteria for the ...
Principal component analysis (PCA) guided clustering approach is widely used in high dimensional dat...
ia that provide significant distinctions between clustering methods and can help selecting appropria...
Experiments are carried out on datasets with different dimensions selected from UCI datasets by usin...
While data clustering has a long history and a large amount of research has been devoted to the deve...
k-means algorithm is a popular data clustering algorithm. k-means clustering aims to partition n obs...
Projection methods for dimension reduction have enabled the discovery of otherwise unattainable stru...
Clustering is one of the most widely used statistical tools for data analysis. Among all existing cl...
This paper studies cluster ensembles for high dimensional data clustering. We examine three differen...
The notion of cluster ability is often used to determine how strong the cluster structure within a s...
A new method for constructing interpretable principal components is proposed. The method first clust...
Clustering high dimensional data is an emerging research field. Subspace clustering or projected clu...
A new method for constructing sparse principal components is proposed. The method first clusters the...
Several techniques are used for clustering of high-dimensional data. Traditionally, clustering appro...
Clustering is one of the most widely used statistical tools for data analysis. Among all existing cl...
K-means clustering is being widely studied problem in a variety of application domains. The computat...
Principal component analysis (PCA) guided clustering approach is widely used in high dimensional dat...
ia that provide significant distinctions between clustering methods and can help selecting appropria...
Experiments are carried out on datasets with different dimensions selected from UCI datasets by usin...
While data clustering has a long history and a large amount of research has been devoted to the deve...
k-means algorithm is a popular data clustering algorithm. k-means clustering aims to partition n obs...
Projection methods for dimension reduction have enabled the discovery of otherwise unattainable stru...
Clustering is one of the most widely used statistical tools for data analysis. Among all existing cl...
This paper studies cluster ensembles for high dimensional data clustering. We examine three differen...
The notion of cluster ability is often used to determine how strong the cluster structure within a s...
A new method for constructing interpretable principal components is proposed. The method first clust...
Clustering high dimensional data is an emerging research field. Subspace clustering or projected clu...
A new method for constructing sparse principal components is proposed. The method first clusters the...
Several techniques are used for clustering of high-dimensional data. Traditionally, clustering appro...
Clustering is one of the most widely used statistical tools for data analysis. Among all existing cl...
K-means clustering is being widely studied problem in a variety of application domains. The computat...
Principal component analysis (PCA) guided clustering approach is widely used in high dimensional dat...
ia that provide significant distinctions between clustering methods and can help selecting appropria...
Experiments are carried out on datasets with different dimensions selected from UCI datasets by usin...