In an age of increasingly large data sets, investigators in many different disciplines have turned to clustering as a tool for data analysis and exploration. Existing clustering methods, however, typically depend on several nontrivial assumptions about the structure of data. Here, we reformulate the clustering problem from an information theoretic perspective that avoids many of these assumptions. In particular, our formulation obviates the need for defining a cluster “prototype,” does not require an a priori similarity metric, is invariant to changes in the representation of the data, and naturally captures nonlinear relations. We apply this approach to different domains and find that it consistently produces clusters that are more coheren...
We present a conceptually simple method for hierarchical clustering of data called mutual informatio...
Numerous papers ask how difficult it is to cluster data. We suggest that the more relevant and inter...
In this work we propose a new information-theoretic clustering algorithm that infers cluster members...
We propose a novel method for clustering data which is grounded in information-theoretic prin-ciples...
We propose a novel method for clustering data which is grounded in information-theoretic prin-ciples...
Discovery of alternative clusterings is an important method for exploring complex datasets. It provi...
Discovery of alternative clusterings is an important method for exploring complex datasets. It provi...
How do we find a natural clustering of a real world point set, which contains an unknown number of c...
How do we find a natural clustering of a real world point set, which contains an unknown number of c...
This paper proposes an information theoretic criterion for comparing two partitions, or clusterings,...
Clustering is part of data mining where data mining is a process in which it is used to analyze data...
[[abstract]]An efficient clustering algorithm is proposed in an unsupervised manner to cluster the g...
This thesis addresses selected aspects of cluster analysis, mainly for microarray data, that include...
Numerous papers ask how difficult it is to cluster data. We suggest that the more relevant and inter...
Numerous papers ask how difficult it is to cluster data. We suggest that the more relevant and inter...
We present a conceptually simple method for hierarchical clustering of data called mutual informatio...
Numerous papers ask how difficult it is to cluster data. We suggest that the more relevant and inter...
In this work we propose a new information-theoretic clustering algorithm that infers cluster members...
We propose a novel method for clustering data which is grounded in information-theoretic prin-ciples...
We propose a novel method for clustering data which is grounded in information-theoretic prin-ciples...
Discovery of alternative clusterings is an important method for exploring complex datasets. It provi...
Discovery of alternative clusterings is an important method for exploring complex datasets. It provi...
How do we find a natural clustering of a real world point set, which contains an unknown number of c...
How do we find a natural clustering of a real world point set, which contains an unknown number of c...
This paper proposes an information theoretic criterion for comparing two partitions, or clusterings,...
Clustering is part of data mining where data mining is a process in which it is used to analyze data...
[[abstract]]An efficient clustering algorithm is proposed in an unsupervised manner to cluster the g...
This thesis addresses selected aspects of cluster analysis, mainly for microarray data, that include...
Numerous papers ask how difficult it is to cluster data. We suggest that the more relevant and inter...
Numerous papers ask how difficult it is to cluster data. We suggest that the more relevant and inter...
We present a conceptually simple method for hierarchical clustering of data called mutual informatio...
Numerous papers ask how difficult it is to cluster data. We suggest that the more relevant and inter...
In this work we propose a new information-theoretic clustering algorithm that infers cluster members...