We have developed and evaluated two parallelization schemes for a tree-based k-means clustering method on shared memory machines. One scheme is to partition the pattern space across processors. We have determined that spatial decomposition of patterns outperforms random decomposition even though random decomposition has almost no load imbalance problem. The other scheme is the parallel traverse of the search tree. This approach solves the load imbalance problem and performs slightly better than the spatial decomposition, but the efficiency is reduced due to thread synchronizations. In both cases, parallel treebased k-means clustering is significantly faster than the direct parallel k-means. © Springer-Verlag Berlin Heidelberg 2001
Abstract. To cluster increasingly massive data sets that are common today in data and text mining, w...
A cluster is a collection of data objects that are similar to each other and dissimilar to the data ...
AbstractThe process of partitioning a large set of patterns into disjoint and homogeneous clusters i...
In this paper, we present a novel algorithm for performing k-means clustering. It organizes all the ...
Ankara : The Department of Computer Engineering and the Institute of Engineering and Science of Bilk...
One among the most influential and popular data mining methods is the k-Means algorithm for cluster ...
Clustering can be defined as the process of partitioning a set of patterns into disjoint and homoge...
Handling and processing of larger volume of data requires efficient data mining algorithms. k-means ...
At present, the explosive growth of data and the mass storage state have brought many problems such ...
In this paper, we present a tree-partition algorithm for parallel mining of frequent patterns. Our w...
The k-means clustering method is one of the most widely used techniques in big data analytics. In th...
K-Means is a popular clustering algorithm which adopts an iterative refinement procedure to determin...
Cluster analysis is a generic term coined for procedures that are used objectively to group entities...
Processing power of pattern classification algorithms on conventional platforms has not been able to...
Clustering is defined as the grouping of similar items in a set, and is an important process within ...
Abstract. To cluster increasingly massive data sets that are common today in data and text mining, w...
A cluster is a collection of data objects that are similar to each other and dissimilar to the data ...
AbstractThe process of partitioning a large set of patterns into disjoint and homogeneous clusters i...
In this paper, we present a novel algorithm for performing k-means clustering. It organizes all the ...
Ankara : The Department of Computer Engineering and the Institute of Engineering and Science of Bilk...
One among the most influential and popular data mining methods is the k-Means algorithm for cluster ...
Clustering can be defined as the process of partitioning a set of patterns into disjoint and homoge...
Handling and processing of larger volume of data requires efficient data mining algorithms. k-means ...
At present, the explosive growth of data and the mass storage state have brought many problems such ...
In this paper, we present a tree-partition algorithm for parallel mining of frequent patterns. Our w...
The k-means clustering method is one of the most widely used techniques in big data analytics. In th...
K-Means is a popular clustering algorithm which adopts an iterative refinement procedure to determin...
Cluster analysis is a generic term coined for procedures that are used objectively to group entities...
Processing power of pattern classification algorithms on conventional platforms has not been able to...
Clustering is defined as the grouping of similar items in a set, and is an important process within ...
Abstract. To cluster increasingly massive data sets that are common today in data and text mining, w...
A cluster is a collection of data objects that are similar to each other and dissimilar to the data ...
AbstractThe process of partitioning a large set of patterns into disjoint and homogeneous clusters i...