Gradient estimation for discrete variables via dependent Monte Carlo samples

Dimitriev, Aleksandar

Publication date

August 2022

DOI

Abstract

Discrete expectations arise in various machine learning tasks, and we often need to backpropagate the gradient through them. One domain is variational inference, where training discrete latent variable models requires gradient estimates of a high dimensional discrete distribution because we are backpropagating through discrete stochastic layer in a deep neural network. Another important area of research is a permutation or ranking based objective where the objective itself is discrete and non-differentiable. To tackle these problems, we propose ARMS, an antithetic REINFORCE-based Monte Carlo gradient estimator for three different discrete distributions: binary, categorical, and Plackett-Luce, where the last two are generalizations of the pr...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Gradient estimation for discrete variables via dependent Monte Carlo samples

Abstract

Extracted data

Gradient estimation for discrete variables via dependent Monte Carlo samples

Abstract

Extracted data

Related items

Related items