<p>The Pitman-Yor process provides an elegant way to cluster data that exhibit power law behavior, where the number of clusters is unknown or unbounded. Unfortunately, inference in PitmanYor process-based models is typically slow and does not scale well with dataset size. In this paper we present new auxiliary-variable representations for the Pitman-Yor process and a special case of the hierarchical Pitman-Yor process that allows us to develop parallel inference algorithms that distribute inference both on the data space and the model space. We show that our method scales well with increasing data while avoiding any degradation in estimate quality</p
Automatic decision making and pattern recognition under uncertainty are difficult tasks that are ubi...
Multiple time series data may exhibit clustering over time and the clustering effect may change acro...
Bayesian nonparametric based models are an elegant way for discovering underlying latent features wi...
The Pitman-Yor process provides an elegant way to cluster data that exhibit power law behavior, wher...
<p>Nonparametric mixture models based on the Dirichlet process are an elegant alternative to finite ...
Multiple time series data may exhibit clustering over time and the clustering effect may change acro...
One of the major concerns for Markov Chain Monte Carlo (MCMC) algorithms is that they can take a lon...
Typically, parallel algorithms are developed to leverage the processing power of multiple processors...
In many situations it is important to be able to propose N independent realizations of a given distr...
Multiple time series data may exhibit clustering over time and the clustering effect may change acro...
Multiple time series data may exhibit clustering over time and the clustering effect may change acro...
Accelerating Markov chain Monte Carlo via parallel predictive prefetching We present a general frame...
In many situations it is important to be able to propose N independent realizations of a given distr...
Communication costs, resulting from synchronization requirements during learning, can greatly slow d...
Abstract. In many situations it is important to be able to propose N independent realizations of a g...
Automatic decision making and pattern recognition under uncertainty are difficult tasks that are ubi...
Multiple time series data may exhibit clustering over time and the clustering effect may change acro...
Bayesian nonparametric based models are an elegant way for discovering underlying latent features wi...
The Pitman-Yor process provides an elegant way to cluster data that exhibit power law behavior, wher...
<p>Nonparametric mixture models based on the Dirichlet process are an elegant alternative to finite ...
Multiple time series data may exhibit clustering over time and the clustering effect may change acro...
One of the major concerns for Markov Chain Monte Carlo (MCMC) algorithms is that they can take a lon...
Typically, parallel algorithms are developed to leverage the processing power of multiple processors...
In many situations it is important to be able to propose N independent realizations of a given distr...
Multiple time series data may exhibit clustering over time and the clustering effect may change acro...
Multiple time series data may exhibit clustering over time and the clustering effect may change acro...
Accelerating Markov chain Monte Carlo via parallel predictive prefetching We present a general frame...
In many situations it is important to be able to propose N independent realizations of a given distr...
Communication costs, resulting from synchronization requirements during learning, can greatly slow d...
Abstract. In many situations it is important to be able to propose N independent realizations of a g...
Automatic decision making and pattern recognition under uncertainty are difficult tasks that are ubi...
Multiple time series data may exhibit clustering over time and the clustering effect may change acro...
Bayesian nonparametric based models are an elegant way for discovering underlying latent features wi...