Learning models with discrete latent variables using stochastic gradient descent remains a challenge due to the high variance of gradient estimates. Modern variance reduction techniques mostly consider categorical distributions and have limited applicability when the number of possible outcomes becomes large. In this work, we consider models with latent permutations and propose control variates for the Plackett-Luce distribution. In particular, the control variates allow us to optimize black-box functions over permutations using stochastic gradient descent. To illustrate the approach, we consider a variety of causal structure learning tasks for continuous and discrete data. We show that our method outperforms competitive relaxation-based op...
Gradient estimation is often necessary for fitting generative models with discrete latent variables,...
Variational inference approximates the posterior distribution of a probabilistic model with a parame...
The field of statistical machine learning has seen a rapid progress in complex hierarchical Bayesian...
Learning models with discrete latent variables using stochastic gradient descent remains a challenge...
Discrete expectations arise in various machine learning tasks, and we often need to backpropagate th...
Stochastic gradient optimization is a class of widely used algorithms for training machine learning ...
<p>Stochastic gradient optimization is a class of widely used algorithms for training machine learni...
Stochastic gradient-based optimisation for discrete latent variable models is challenging due to the...
Variational inference has become a widely used method to approximate posteriors in complex latent va...
Variational inference has become a widely used method to approximate posteriors in complex latent va...
Variational inference has become a widely used method to approximate posteriors in complex latent va...
How can we perform efficient inference and learning in directed probabilistic models, in the presenc...
We develop an optimization algorithm suitable for Bayesian learning in complex models. Our approach ...
Training models with discrete latent variables is challenging due to the high variance of unbiased g...
This thesis focuses on the variational learning of latent Gaussian models for discrete data. The lea...
Gradient estimation is often necessary for fitting generative models with discrete latent variables,...
Variational inference approximates the posterior distribution of a probabilistic model with a parame...
The field of statistical machine learning has seen a rapid progress in complex hierarchical Bayesian...
Learning models with discrete latent variables using stochastic gradient descent remains a challenge...
Discrete expectations arise in various machine learning tasks, and we often need to backpropagate th...
Stochastic gradient optimization is a class of widely used algorithms for training machine learning ...
<p>Stochastic gradient optimization is a class of widely used algorithms for training machine learni...
Stochastic gradient-based optimisation for discrete latent variable models is challenging due to the...
Variational inference has become a widely used method to approximate posteriors in complex latent va...
Variational inference has become a widely used method to approximate posteriors in complex latent va...
Variational inference has become a widely used method to approximate posteriors in complex latent va...
How can we perform efficient inference and learning in directed probabilistic models, in the presenc...
We develop an optimization algorithm suitable for Bayesian learning in complex models. Our approach ...
Training models with discrete latent variables is challenging due to the high variance of unbiased g...
This thesis focuses on the variational learning of latent Gaussian models for discrete data. The lea...
Gradient estimation is often necessary for fitting generative models with discrete latent variables,...
Variational inference approximates the posterior distribution of a probabilistic model with a parame...
The field of statistical machine learning has seen a rapid progress in complex hierarchical Bayesian...