This chapter presents a unified framework to generalize a number of fuzzy clustering algorithms to handle distributed data in an exact way, i.e., with no approximation of results with respect to their original centralized versions. The same framework allows the exact distribution of relative validity indices used to evaluate the quality of fuzzy clustering solutions. Complexity analyses for each distributed algorithm and index are reported in terms of space, time, and communication aspects. A general procedure to estimate the number of clusters in a non-centralized fashion using the proposed framework is also described. Such a procedure is directly applicable not only to distributed data, but to parallel data processing scenarios as well. E...
Cluster analysis is a multivariate statistical classification method, implying different methods and...
Cluster validation is a major issue in cluster analysis. Many existing validity indices do not perfo...
Clustering has become one of the most widely used tasks in analyzing the vast amount of data. In clu...
This chapter presents a unified framework to generalize a number of fuzzy clustering algorithms to h...
We present a consensus-based algorithm to distributed fuzzy clustering that allows automatic estimat...
There are some variants of the widely used Fuzzy C-Means (FCM) algorithm that support clustering dat...
Since clustering is an unsupervised method and there is no a-priori indication for the actual number...
Clustering can be defined as the process of grouping physical or abstract objects into classes of si...
A hard partition clustering algorithm assigns equally distant points to one of the clusters, where e...
Fuzzy clustering is useful to mine complex and multi-dimensional datasets, where the members have pa...
The application of fuzzy cluster analysis to larger data sets can cause runtime and memory overflow ...
Because traditional fuzzy clustering validity indices need to specify the number of clusters and are...
Two well-known drawbacks in fuzzy clustering are the requirement of assigning in advance the number...
Abstract: Finding the optimal cluster number and validating the partition results of a data set are ...
Cluster analysis is an important tool in the exploration of large collections of data, revealing pat...
Cluster analysis is a multivariate statistical classification method, implying different methods and...
Cluster validation is a major issue in cluster analysis. Many existing validity indices do not perfo...
Clustering has become one of the most widely used tasks in analyzing the vast amount of data. In clu...
This chapter presents a unified framework to generalize a number of fuzzy clustering algorithms to h...
We present a consensus-based algorithm to distributed fuzzy clustering that allows automatic estimat...
There are some variants of the widely used Fuzzy C-Means (FCM) algorithm that support clustering dat...
Since clustering is an unsupervised method and there is no a-priori indication for the actual number...
Clustering can be defined as the process of grouping physical or abstract objects into classes of si...
A hard partition clustering algorithm assigns equally distant points to one of the clusters, where e...
Fuzzy clustering is useful to mine complex and multi-dimensional datasets, where the members have pa...
The application of fuzzy cluster analysis to larger data sets can cause runtime and memory overflow ...
Because traditional fuzzy clustering validity indices need to specify the number of clusters and are...
Two well-known drawbacks in fuzzy clustering are the requirement of assigning in advance the number...
Abstract: Finding the optimal cluster number and validating the partition results of a data set are ...
Cluster analysis is an important tool in the exploration of large collections of data, revealing pat...
Cluster analysis is a multivariate statistical classification method, implying different methods and...
Cluster validation is a major issue in cluster analysis. Many existing validity indices do not perfo...
Clustering has become one of the most widely used tasks in analyzing the vast amount of data. In clu...