In models that define probabilities via energies, maximum likelihood learning typically involves using Markov Chain Monte Carlo to sample from the model’s distribution. If the Markov chain is started at the data distribution, learning often works well even if the chain is only run for a few time steps [2]. But if the data distribution contains modes separated by regions of very low density, brief MCMC will not ensure that different modes have the correct relative energies because it cannot move particles from one mode to another. We show how to improve brief MCMC by allowing long-range moves that are suggested by the data distribution. If the model is approximately correct, these long-range moves have a reasonable acceptance rate.
Abstract. When dealing with datasets containing a billion instances or with sim-ulations that requir...
Abstract. When dealing with datasets containing a billion instances or with sim-ulations that requir...
Exponential-family models for dependent data have applications in a wide variety of areas, but the d...
In models that define probabilities via energies, maximum likelihood learning typically involves us...
In machine learning and statistics, probabilistic inference involving multimodal distributions is qu...
Maximum-likelihood (ML) learning of Markov random fields is challenging because it requires estima...
In machine learning and statistics, probabilistic inference involving multimodal distributions is qu...
A and B Comparing single amino acid frequencies (left), pairwise amino acid frequencies (center) and...
Learning Markov random field (MRF) models is notoriously hard due to the presence of a global norm...
We develop a method to combine Markov chain Monte Carlo (MCMC) and variational inference (VI), lever...
This study investigates the effects of Markov chain Monte Carlo (MCMC) sampling in unsupervised Maxi...
We propose a new class of learning algorithms that combines variational approximation and Markov cha...
In case of a non-linear relation linking the model to the data, numerical Markov Chain Monte Carlo (...
For a given Markov chain Monte Carlo algorithm we introduce a distance between two configurations th...
AbstractCarefully injected noise can speed the average convergence of Markov chain Monte Carlo (MCMC...
Abstract. When dealing with datasets containing a billion instances or with sim-ulations that requir...
Abstract. When dealing with datasets containing a billion instances or with sim-ulations that requir...
Exponential-family models for dependent data have applications in a wide variety of areas, but the d...
In models that define probabilities via energies, maximum likelihood learning typically involves us...
In machine learning and statistics, probabilistic inference involving multimodal distributions is qu...
Maximum-likelihood (ML) learning of Markov random fields is challenging because it requires estima...
In machine learning and statistics, probabilistic inference involving multimodal distributions is qu...
A and B Comparing single amino acid frequencies (left), pairwise amino acid frequencies (center) and...
Learning Markov random field (MRF) models is notoriously hard due to the presence of a global norm...
We develop a method to combine Markov chain Monte Carlo (MCMC) and variational inference (VI), lever...
This study investigates the effects of Markov chain Monte Carlo (MCMC) sampling in unsupervised Maxi...
We propose a new class of learning algorithms that combines variational approximation and Markov cha...
In case of a non-linear relation linking the model to the data, numerical Markov Chain Monte Carlo (...
For a given Markov chain Monte Carlo algorithm we introduce a distance between two configurations th...
AbstractCarefully injected noise can speed the average convergence of Markov chain Monte Carlo (MCMC...
Abstract. When dealing with datasets containing a billion instances or with sim-ulations that requir...
Abstract. When dealing with datasets containing a billion instances or with sim-ulations that requir...
Exponential-family models for dependent data have applications in a wide variety of areas, but the d...