Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units

Young, Kenny

Open link

Publication date

June 2022

DOI

10.1609/aaai.v36i8.20874

Publisher

Association for the Advancement of Artificial Intelligence

Abstract

Training neural networks with discrete stochastic variables presents a unique challenge. Backpropagation is not directly applicable, nor are the reparameterization tricks used in networks with continuous stochastic variables. To address this challenge, we present Hindsight Network Credit Assignment (HNCA), a novel gradient estimation algorithm for networks of discrete stochastic units. HNCA works by assigning credit to each unit based on the degree to which its output influences its immediate children in the network. We prove that HNCA produces unbiased gradient estimates with reduced variance compared to the REINFORCE estimator, while the computational cost is similar to that of backpropagation. We first apply HNCA in a contextual bandit s...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units

Abstract

Extracted data

Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units

Abstract

Extracted data

Related items

Related items