We introduce the Pitman Yor Diffusion Tree (PYDT) for hierarchical clustering, a generalization of the Dirichlet Diffusion Tree (Neal, 2001) which removes the restriction to binary branching structure. The generative process is described and shown to result in an exchangeable distribution over data points. We prove some theoretical properties of the model and then present two inference methods: a collapsed MCMC sampler which allows us to model uncertainty over tree structures, and a computationally efficient greedy Bayesian EM search algorithm. Both algorithms use message passing on the tree structure. The utility of the model and algorithms is demonstrated on synthetic and real world data, both continuous and binary
This paper develops a new method for hierarchical clustering. Unlike other existing clustering schem...
International audienceIn this paper, we introduce a two step methodology to extract a hierarchical c...
Hidden tree Markov models allow learning distributions for tree structured data while being interpre...
In this paper we introduce the Pitman Yor Diffusion Tree (PYDT), a Bayesian non-parametric prior ove...
Abstract—In this paper we introduce the Pitman Yor Diffusion Tree (PYDT), a Bayesian non-parametric ...
We demonstrate efficient approximate infer-ence for the Dirichlet Diffusion Tree (Neal, 2003), a Bay...
We define the beta diffusion tree, a random tree structure with a set of leaves that defines a colle...
We define the beta diffusion tree, a random tree structure with a set of leaves that defines a colle...
We define the beta diffusion tree, a random tree structure with a set of leaves that defines a colle...
We present a novel algorithm for agglomerative hierarchical clustering based on evaluating marginal ...
Many data are naturally modeled by an unobserved hierarchical structure. In this paper we propose a ...
We define the beta diffusion tree, a random tree structure with a set of leaves that defines a colle...
Discovering hierarchical regularities in data is a key problem in interacting with large datasets, m...
Discovering hierarchical regularities in data is a key problem in interacting with large datasets, m...
This paper develops a new method for hierarchical clustering. Unlike other existing clustering schem...
This paper develops a new method for hierarchical clustering. Unlike other existing clustering schem...
International audienceIn this paper, we introduce a two step methodology to extract a hierarchical c...
Hidden tree Markov models allow learning distributions for tree structured data while being interpre...
In this paper we introduce the Pitman Yor Diffusion Tree (PYDT), a Bayesian non-parametric prior ove...
Abstract—In this paper we introduce the Pitman Yor Diffusion Tree (PYDT), a Bayesian non-parametric ...
We demonstrate efficient approximate infer-ence for the Dirichlet Diffusion Tree (Neal, 2003), a Bay...
We define the beta diffusion tree, a random tree structure with a set of leaves that defines a colle...
We define the beta diffusion tree, a random tree structure with a set of leaves that defines a colle...
We define the beta diffusion tree, a random tree structure with a set of leaves that defines a colle...
We present a novel algorithm for agglomerative hierarchical clustering based on evaluating marginal ...
Many data are naturally modeled by an unobserved hierarchical structure. In this paper we propose a ...
We define the beta diffusion tree, a random tree structure with a set of leaves that defines a colle...
Discovering hierarchical regularities in data is a key problem in interacting with large datasets, m...
Discovering hierarchical regularities in data is a key problem in interacting with large datasets, m...
This paper develops a new method for hierarchical clustering. Unlike other existing clustering schem...
This paper develops a new method for hierarchical clustering. Unlike other existing clustering schem...
International audienceIn this paper, we introduce a two step methodology to extract a hierarchical c...
Hidden tree Markov models allow learning distributions for tree structured data while being interpre...