We present an algorithm for policy search in stochastic dynamical systems using model-based reinforcement learning. The system dynamics are described with Bayesian neural networks (BNNs) that include stochastic input variables. These input variables allow us to capture complex statistical patterns in the transition dynamics (e.g. multi-modality and heteroskedasticity), which are usually missed by alternative modeling approaches. After learning the dynamics, our BNNs are then fed into an algorithm that performs random roll-outs and uses stochastic optimization for policy learning. We train our BNNs by minimizing a-divergences with a = 0.5, which usually produces better results than other techniques such as variational Bayes. We illustrate th...
Deep Learning-based models are becoming more and more relevant for an increasing number of applicati...
Recently, Bayesian Optimization (BO) has been used to successfully optimize parametric policies in s...
Application of learning algorithms to robotics and control problems with highly nonlinear dynamics t...
We present an algorithm for policy search in stochastic dynamical systems using model-based reinforc...
We present an algorithm for policy search in stochastic dynamical systems using model-based reinforc...
© ICLR 2019 - Conference Track Proceedings. All rights reserved. We present an algorithm for policy ...
Policy search algorithms have facilitated application of Reinforcement Learning (RL) to dynamic syst...
Policy search algorithms have facilitated application of Reinforcement Learning (RL) to dynamic syst...
A probabilistic reinforcement learning algorithm is presented for finding control policies in contin...
peer reviewedBayesian Reinforcement Learning (BRL) agents aim to maximise the expected collected rew...
Abstract: Reinforcement learning (RL) is a kind of machine learning. It aims to optimize agents ’ po...
We present a policy search method that uses iteratively refitted local linear models to optimize tra...
We present a policy search method that uses iteratively refitted local linear models to optimize tra...
We present a policy search method that uses iteratively refitted local linear models to optimize tra...
Applications to learn control of unfamiliar dynamical systems with increasing autonomy are ubiquitou...
Deep Learning-based models are becoming more and more relevant for an increasing number of applicati...
Recently, Bayesian Optimization (BO) has been used to successfully optimize parametric policies in s...
Application of learning algorithms to robotics and control problems with highly nonlinear dynamics t...
We present an algorithm for policy search in stochastic dynamical systems using model-based reinforc...
We present an algorithm for policy search in stochastic dynamical systems using model-based reinforc...
© ICLR 2019 - Conference Track Proceedings. All rights reserved. We present an algorithm for policy ...
Policy search algorithms have facilitated application of Reinforcement Learning (RL) to dynamic syst...
Policy search algorithms have facilitated application of Reinforcement Learning (RL) to dynamic syst...
A probabilistic reinforcement learning algorithm is presented for finding control policies in contin...
peer reviewedBayesian Reinforcement Learning (BRL) agents aim to maximise the expected collected rew...
Abstract: Reinforcement learning (RL) is a kind of machine learning. It aims to optimize agents ’ po...
We present a policy search method that uses iteratively refitted local linear models to optimize tra...
We present a policy search method that uses iteratively refitted local linear models to optimize tra...
We present a policy search method that uses iteratively refitted local linear models to optimize tra...
Applications to learn control of unfamiliar dynamical systems with increasing autonomy are ubiquitou...
Deep Learning-based models are becoming more and more relevant for an increasing number of applicati...
Recently, Bayesian Optimization (BO) has been used to successfully optimize parametric policies in s...
Application of learning algorithms to robotics and control problems with highly nonlinear dynamics t...