Learning and policy search in stochastic dynamical systems with Bayesian neural networks

Depeweg, S
Hernández-Lobato, JM
Doshi-Velez, F
Udluft, S
,

Publication date

January 2017

Abstract

We present an algorithm for policy search in stochastic dynamical systems using model-based reinforcement learning. The system dynamics are described with Bayesian neural networks (BNNs) that include stochastic input variables. These input variables allow us to capture complex statistical patterns in the transition dynamics (e.g. multi-modality and heteroskedasticity), which are usually missed by alternative modeling approaches. After learning the dynamics, our BNNs are then fed into an algorithm that performs random roll-outs and uses stochastic optimization for policy learning. We train our BNNs by minimizing a-divergences with a = 0.5, which usually produces better results than other techniques such as variational Bayes. We illustrate th...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Learning and policy search in stochastic dynamical systems with Bayesian neural networks

Abstract

Extracted data

Learning and policy search in stochastic dynamical systems with Bayesian neural networks

Abstract

Extracted data

Related items

Related items