This paper proposes a k-means type clustering algorithm that can automatically calculate variable weights. A new step is introduced to the k-means clustering process to iteratively update variable weights based on the current partition of data and a formula for weight calculation is proposed. The convergency theorem of the new clustering process is given. The variable weights produced by the algorithm measure the importance of variables in clustering and can be used in variable selection in data mining applications where large and complex real data are often involved. Experimental results on both synthetic and real data have shown that the new algorithm outperformed the standard k-means type algorithms in recovering clusters in data
This paper proposes a weight-based self-constructing clustering method for time series data. Self-co...
K-means is one of the most popular and widespread partitioning clustering algorithms due to its supe...
Clustering is a robust machine learning task that involves dividing data points into a set of groups...
Abstract—This paper proposes a k-means type clustering algorithm that can automatically calculate va...
One of the most important problems in cluster analysis is the selection of variables that truly defi...
Cluster analysis is a statistical analysis technique that divides the research objects into relative...
Semi-supervised learning, which uses a small amount of labeled data in conjunction with a large amou...
This paper is concerned with the co-clustering of distribution-valued data, that is, the simultaneou...
Abstract:- Clustering constitutes an important task inside the fields of Pattern Recognition and Dat...
Advances in recent techniques for scientific data collection in the era of big data allow for the sy...
Data mining is the process of finding structure of data from large data sets. With this process, the...
Abstract: The K-means algorithm is a popular data-clustering algorithm. However, one of its drawback...
Cluster analysis method is one of the most analytical methods of data mining. The method will direct...
This paper describes three different fundamental mathematical programming approaches that are releva...
Abstract-Data mining is the process of using technology to identi-fy patterns and prospects from lar...
This paper proposes a weight-based self-constructing clustering method for time series data. Self-co...
K-means is one of the most popular and widespread partitioning clustering algorithms due to its supe...
Clustering is a robust machine learning task that involves dividing data points into a set of groups...
Abstract—This paper proposes a k-means type clustering algorithm that can automatically calculate va...
One of the most important problems in cluster analysis is the selection of variables that truly defi...
Cluster analysis is a statistical analysis technique that divides the research objects into relative...
Semi-supervised learning, which uses a small amount of labeled data in conjunction with a large amou...
This paper is concerned with the co-clustering of distribution-valued data, that is, the simultaneou...
Abstract:- Clustering constitutes an important task inside the fields of Pattern Recognition and Dat...
Advances in recent techniques for scientific data collection in the era of big data allow for the sy...
Data mining is the process of finding structure of data from large data sets. With this process, the...
Abstract: The K-means algorithm is a popular data-clustering algorithm. However, one of its drawback...
Cluster analysis method is one of the most analytical methods of data mining. The method will direct...
This paper describes three different fundamental mathematical programming approaches that are releva...
Abstract-Data mining is the process of using technology to identi-fy patterns and prospects from lar...
This paper proposes a weight-based self-constructing clustering method for time series data. Self-co...
K-means is one of the most popular and widespread partitioning clustering algorithms due to its supe...
Clustering is a robust machine learning task that involves dividing data points into a set of groups...