Beyond Target Networks: Improving Deep $Q$-learning with Functional Regularization

Piché, Alexandre
Thomas, Valentin
Marino, Joseph
Marconi, Gian Maria
Pal, Christopher
Khan, Mohammad Emtiyaz

Publication date

February 2022

Abstract

Much of the recent successes in Deep Reinforcement Learning have been based on minimizing the squared Bellman error. However, training is often unstable due to fast-changing target Q-values, and target networks are employed to regularize the Q-value estimation and stabilize training by using an additional set of lagging parameters. Despite their advantages, target networks are potentially an inflexible way to regularize Q-values which may ultimately slow down training. In this work, we address this issue by augmenting the squared Bellman error with a functional regularizer. Unlike target networks, the regularization we propose here is explicit and enables us to use up-to-date parameters as well as control the regularization. This leads to a...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Beyond Target Networks: Improving Deep $Q$-learning with Functional Regularization

Abstract

Extracted data

Beyond Target Networks: Improving Deep $Q$-learning with Functional Regularization

Abstract

Extracted data

Related items

Related items