Training models with discrete latent variables is challenging due to the high variance of unbiased gradient estimators. While low-variance reparameterization gradients of a continuous relaxation can provide an effective solution, a continuous relaxation is not always available or tractable. Dong et al. (2020) and Yin et al. (2020) introduced a performant estimator that does not rely on continuous relaxations; however, it is limited to binary random variables. We introduce a novel derivation of their estimator based on importance sampling and statistical couplings, which we extend to the categorical setting. Motivated by the construction of a stick-breaking coupling, we introduce gradient estimators based on reparameterizing categorical vari...
Policy-gradient methods in Reinforcement Learning(RL) are very universal and widely applied in pract...
peer reviewedIn this paper, we propose an extension to the policy gradient algorithms by allowing st...
Training Restricted Boltzmann Machines (RBMs) has been challenging for a long time due to the diffic...
Stochastic gradient-based optimisation for discrete latent variable models is challenging due to the...
Discrete expectations arise in various machine learning tasks, and we often need to backpropagate th...
Gradient estimation is often necessary for fitting generative models with discrete latent variables,...
Optimization with noisy gradients has become ubiquitous in statistics and machine learning. Reparame...
Learning models with discrete latent variables using stochastic gradient descent remains a challenge...
Policy gradient methods are a widely used class of model-free reinforcement learning algorithms wher...
Countless signal processing applications include the reconstruction of signals from few indirect lin...
We present the results of a simulation study performed to compare the accuracy of a lassotype penali...
Policy gradient methods are a widely used class of model-free reinforcement learning algorithms wher...
The pairwise objective paradigms are an important and essential aspect of machine learning. Examples...
Traditional approaches to variational inference rely on parametric families of variational distribut...
Despite the recent growth of theoretical studies and empirical successes of neural networks, gradien...
Policy-gradient methods in Reinforcement Learning(RL) are very universal and widely applied in pract...
peer reviewedIn this paper, we propose an extension to the policy gradient algorithms by allowing st...
Training Restricted Boltzmann Machines (RBMs) has been challenging for a long time due to the diffic...
Stochastic gradient-based optimisation for discrete latent variable models is challenging due to the...
Discrete expectations arise in various machine learning tasks, and we often need to backpropagate th...
Gradient estimation is often necessary for fitting generative models with discrete latent variables,...
Optimization with noisy gradients has become ubiquitous in statistics and machine learning. Reparame...
Learning models with discrete latent variables using stochastic gradient descent remains a challenge...
Policy gradient methods are a widely used class of model-free reinforcement learning algorithms wher...
Countless signal processing applications include the reconstruction of signals from few indirect lin...
We present the results of a simulation study performed to compare the accuracy of a lassotype penali...
Policy gradient methods are a widely used class of model-free reinforcement learning algorithms wher...
The pairwise objective paradigms are an important and essential aspect of machine learning. Examples...
Traditional approaches to variational inference rely on parametric families of variational distribut...
Despite the recent growth of theoretical studies and empirical successes of neural networks, gradien...
Policy-gradient methods in Reinforcement Learning(RL) are very universal and widely applied in pract...
peer reviewedIn this paper, we propose an extension to the policy gradient algorithms by allowing st...
Training Restricted Boltzmann Machines (RBMs) has been challenging for a long time due to the diffic...