A useful definition of ‘big data’ is data that is too big to process comfortably on a single machine, either because of processor, memory, or disk bottlenecks. Graphics processing units can alleviate the processor bottleneck, but memory or disk bottlenecks can only be eliminated by splitting data across multiple machines. Communication between large numbers of machines is expensive (regardless of the amount of data being communicated), so there is a need for algorithms that perform distributed approximate Bayesian analyses with minimal communication. Consensus Monte Carlo operates by running a separate Monte Carlo algorithm on each machine, and then averaging individual Monte Carlo draws across machines. Depending on the model, the resultin...
Emerging many-core computer architectures provide an incentive for computational methods to exhibit ...
Bayesian methods provide the means for studying probabilistic models of linear as well as non-linear...
Markov Chain Monte Carlo (MCMC) methods are fundamental tools for sampling highly complex distributi...
To conduct Bayesian inference with large data sets, it is often convenient or necessary to distribut...
This paper proposes a simple, practical and efficient MCMC algorithm for Bayesian analysis of big da...
This paper studies distributed Bayesian learning in a setting encompassing a central server and mult...
We propose a general method for distributed Bayesian model choice, using the marginal likelihood, wh...
Divide-and-conquer strategies for Monte Carlo algorithms are an increasingly popular approach to mak...
<p>Many modern applications fall into the category of "large-scale" statistical problems, in which b...
Markov chain Monte Carlo (MCMC) methods have proven to be a very powerful tool for analyzing data of...
Multilevel models are extremely useful in handling large hierarchical datasets. However, computation...
Computational intensity and sequential nature of estimation techniques for Bayesian methods in stati...
Bayesian statistics has emerged as a leading paradigm for the analysis of complicated datasets and f...
There has recently been considerable interest in addressing the problem of unifying distributed stat...
This thesis focuses on sources of error in modern Bayesian analysis and machine learning in the ``bi...
Emerging many-core computer architectures provide an incentive for computational methods to exhibit ...
Bayesian methods provide the means for studying probabilistic models of linear as well as non-linear...
Markov Chain Monte Carlo (MCMC) methods are fundamental tools for sampling highly complex distributi...
To conduct Bayesian inference with large data sets, it is often convenient or necessary to distribut...
This paper proposes a simple, practical and efficient MCMC algorithm for Bayesian analysis of big da...
This paper studies distributed Bayesian learning in a setting encompassing a central server and mult...
We propose a general method for distributed Bayesian model choice, using the marginal likelihood, wh...
Divide-and-conquer strategies for Monte Carlo algorithms are an increasingly popular approach to mak...
<p>Many modern applications fall into the category of "large-scale" statistical problems, in which b...
Markov chain Monte Carlo (MCMC) methods have proven to be a very powerful tool for analyzing data of...
Multilevel models are extremely useful in handling large hierarchical datasets. However, computation...
Computational intensity and sequential nature of estimation techniques for Bayesian methods in stati...
Bayesian statistics has emerged as a leading paradigm for the analysis of complicated datasets and f...
There has recently been considerable interest in addressing the problem of unifying distributed stat...
This thesis focuses on sources of error in modern Bayesian analysis and machine learning in the ``bi...
Emerging many-core computer architectures provide an incentive for computational methods to exhibit ...
Bayesian methods provide the means for studying probabilistic models of linear as well as non-linear...
Markov Chain Monte Carlo (MCMC) methods are fundamental tools for sampling highly complex distributi...