Gradient estimation -- approximating the gradient of an expectation with respect to the parameters of a distribution -- is central to the solution of many machine learning problems. However, when the distribution is discrete, most common gradient estimators suffer from excessive variance. To improve the quality of gradient estimation, we introduce a variance reduction technique based on Stein operators for discrete distributions. We then use this technique to build flexible control variates for the REINFORCE leave-one-out estimator. Our control variates can be adapted online to minimize variance and do not require extra evaluations of the target function. In benchmark generative modeling tasks such as training binary variational autoencoder...
Countless signal processing applications include the reconstruction of signals from few indirect lin...
In this paper we analyze different schemes for obtaining gradient estimates when the underlying func...
We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Mark...
Gradient estimation is often necessary for fitting generative models with discrete latent variables,...
Stochastic gradient optimization is a class of widely used algorithms for training machine learning ...
<p>Stochastic gradient optimization is a class of widely used algorithms for training machine learni...
Optimization with noisy gradients has become ubiquitous in statistics and machine learning. Reparame...
Discrete expectations arise in various machine learning tasks, and we often need to backpropagate th...
We analyse the properties of an unbiased gradient estimator of the evidence lower bound (ELBO) for ...
The integration of discrete algorithmic components in deep learning architectures has numerous appli...
237 pagesIt seems that in the current age, computers, computation, and data have an increasingly imp...
Stochastic gradient-based optimisation for discrete latent variable models is challenging due to the...
Countless signal processing applications include the reconstruction of signals from few indirect lin...
The field of statistical machine learning has seen a rapid progress in complex hierarchical Bayesian...
International audienceAlgorithms to solve variational regularization of ill-posed inverse problems u...
Countless signal processing applications include the reconstruction of signals from few indirect lin...
In this paper we analyze different schemes for obtaining gradient estimates when the underlying func...
We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Mark...
Gradient estimation is often necessary for fitting generative models with discrete latent variables,...
Stochastic gradient optimization is a class of widely used algorithms for training machine learning ...
<p>Stochastic gradient optimization is a class of widely used algorithms for training machine learni...
Optimization with noisy gradients has become ubiquitous in statistics and machine learning. Reparame...
Discrete expectations arise in various machine learning tasks, and we often need to backpropagate th...
We analyse the properties of an unbiased gradient estimator of the evidence lower bound (ELBO) for ...
The integration of discrete algorithmic components in deep learning architectures has numerous appli...
237 pagesIt seems that in the current age, computers, computation, and data have an increasingly imp...
Stochastic gradient-based optimisation for discrete latent variable models is challenging due to the...
Countless signal processing applications include the reconstruction of signals from few indirect lin...
The field of statistical machine learning has seen a rapid progress in complex hierarchical Bayesian...
International audienceAlgorithms to solve variational regularization of ill-posed inverse problems u...
Countless signal processing applications include the reconstruction of signals from few indirect lin...
In this paper we analyze different schemes for obtaining gradient estimates when the underlying func...
We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Mark...