Most clustering algorithms assume that all dimensions of the data can be described by a single structure. Cross-clustering (or multi-view clustering) allows multiple structures, each applying to a subset of the dimen-sions. We present a novel approach to cross-clustering, based on approximating the so-lution to a Cross Dirichlet Process mixture (CDPM) model [Shafto et al., 2006, Mans-inghka et al., 2009]. Our bottom-up, de-terministic approach results in a hierarchi-cal clustering of dimensions, and at each node, a hierarchical clustering of data points. We also present a randomized approxima-tion, based on a truncated hierarchy, that scales linearly in the number of levels. Re-sults on synthetic and real-world data sets demonstrate that th...
One of the most important goals of unsupervised learning is to discover meaningful clusters in data....
<div><p>The use of mutual information as a similarity measure in agglomerative hierarchical clusteri...
The use of mutual information as a similarity measure in agglomerative hierarchical clustering (AHC)...
We present a novel algorithm for agglomerative hierarchical clustering based on evaluating marginal ...
Four of the most common limitations of the many available clustering methods are: i) the lack of a p...
<div><p>Four of the most common limitations of the many available clustering methods are: i) the lac...
Four of the most common limitations of the many available clustering methods are: i) the lack of a p...
Four of the most common limitations of the many available clustering methods are: i) the lack of a p...
We propose the hierarchical Dirichlet process (HDP), a nonparametric Bayesian model for clustering ...
Description The method performs bottom-up hierarchical clustering, using a Dirichlet Process (infi-n...
Abstract We introduce a new Bayesian model for hierarchical clustering based on a priorover trees ca...
<p>I consider the problem of clustering multiple related groups of data. My approach entails mixtur...
Recent advances in high throughput methodologies offer researchers the ability to understand com-ple...
<p>Clustering methods are designed to separate heterogeneous data into groups of similar objects suc...
In the Bayesian nonparametric family, Dirichlet Process (DP) is a prior distribution that is able to...
One of the most important goals of unsupervised learning is to discover meaningful clusters in data....
<div><p>The use of mutual information as a similarity measure in agglomerative hierarchical clusteri...
The use of mutual information as a similarity measure in agglomerative hierarchical clustering (AHC)...
We present a novel algorithm for agglomerative hierarchical clustering based on evaluating marginal ...
Four of the most common limitations of the many available clustering methods are: i) the lack of a p...
<div><p>Four of the most common limitations of the many available clustering methods are: i) the lac...
Four of the most common limitations of the many available clustering methods are: i) the lack of a p...
Four of the most common limitations of the many available clustering methods are: i) the lack of a p...
We propose the hierarchical Dirichlet process (HDP), a nonparametric Bayesian model for clustering ...
Description The method performs bottom-up hierarchical clustering, using a Dirichlet Process (infi-n...
Abstract We introduce a new Bayesian model for hierarchical clustering based on a priorover trees ca...
<p>I consider the problem of clustering multiple related groups of data. My approach entails mixtur...
Recent advances in high throughput methodologies offer researchers the ability to understand com-ple...
<p>Clustering methods are designed to separate heterogeneous data into groups of similar objects suc...
In the Bayesian nonparametric family, Dirichlet Process (DP) is a prior distribution that is able to...
One of the most important goals of unsupervised learning is to discover meaningful clusters in data....
<div><p>The use of mutual information as a similarity measure in agglomerative hierarchical clusteri...
The use of mutual information as a similarity measure in agglomerative hierarchical clustering (AHC)...