The K-means clustering algorithm is an old algorithm that has been intensely researched owing to its simplicity of implementation. However, there have also been criticisms on its performance, in particular, for demanding the value of K a priori. It is evident from previous researches that providing the number of clusters a priori does not in any way assist in the production of good quality clusters. The objective of this paper is to investigate the usefulness of the K-means clustering in the clustering of high and multi-dimensional data by applying it to biological sequence data which is known for high and multi-dimension. The squared-Euclidean distance and the cosine measure are used as the similarity measures. The silhouette validity inde...
A clustering algorithm that exploits special characteristics of a data set may lead to superior resu...
Data clustering techniques are valuable tools for researchers working with large databases of multiv...
In cancer research, class discovery is the first process for investigating a new dataset for which h...
Advances in recent techniques for scientific data collection in the era of big data allow for the sy...
<div><p>Traditional k-means and most k-means variants are still computationally expensive for large ...
Traditional k-means and most k-means variants are still computationally expensive for large datasets...
K-means clustering is being widely studied problem in a variety of application domains. The computat...
Abstract: K-means algorithm is a popular, unsupervised and iterative clustering algorithmwell known ...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
More and more data are produced every day. Some clustering techniques have been developed to automat...
The K-means clustering algorithm works on a data set with n data points in d dimensional space R^d. ...
The data mining is the knowledge extraction or finding the hidden patterns from large data these dat...
Working with huge amount of data and learning from it by extracting useful information is one of the...
Data mining technique used in the field of clustering is a subject of active research and assists in...
The unsupervised clustering of biological sequences is an important task in the bioinformatics space...
A clustering algorithm that exploits special characteristics of a data set may lead to superior resu...
Data clustering techniques are valuable tools for researchers working with large databases of multiv...
In cancer research, class discovery is the first process for investigating a new dataset for which h...
Advances in recent techniques for scientific data collection in the era of big data allow for the sy...
<div><p>Traditional k-means and most k-means variants are still computationally expensive for large ...
Traditional k-means and most k-means variants are still computationally expensive for large datasets...
K-means clustering is being widely studied problem in a variety of application domains. The computat...
Abstract: K-means algorithm is a popular, unsupervised and iterative clustering algorithmwell known ...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
More and more data are produced every day. Some clustering techniques have been developed to automat...
The K-means clustering algorithm works on a data set with n data points in d dimensional space R^d. ...
The data mining is the knowledge extraction or finding the hidden patterns from large data these dat...
Working with huge amount of data and learning from it by extracting useful information is one of the...
Data mining technique used in the field of clustering is a subject of active research and assists in...
The unsupervised clustering of biological sequences is an important task in the bioinformatics space...
A clustering algorithm that exploits special characteristics of a data set may lead to superior resu...
Data clustering techniques are valuable tools for researchers working with large databases of multiv...
In cancer research, class discovery is the first process for investigating a new dataset for which h...