BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs

Katt, Sammie (author)
Nguyen, Hai (author)
Oliehoek, F.A. (author)
Amato, Christopher (author)

Publication date

January 2022

Publisher

International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)

Abstract

While reinforcement learning (RL) has made great advances in scalability, exploration and partial observability are still active research topics. In contrast, Bayesian RL (BRL) provides a principled answer to both state estimation and the exploration-exploitation trade-off, but struggles to scale. To tackle this challenge, BRL frameworks with various prior assumptions have been proposed, with varied success. This work presents a representation-agnostic formulation of BRL under partially observability, unifying the previous models under one theoretical umbrella. To demonstrate its practical significance we also propose a novel derivation, Bayes-Adaptive Deep Dropout rl (BADDr), based on dropout networks. Under this parameterization, in contr...

Extracted data

We use cookies to provide a better user experience.

Data Protection

BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs

Abstract

Extracted data

BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs

Abstract

Extracted data

Related items

Related items