Bayesian learning for policy search in trajectory control of a planar manipulator

Tavakol Aghaei, Vahid
Ağababaoğlu, Arda
Onat, Ahmet
Yıldırım, Sinan

Open link

Publication date

March 2019

DOI

10.1109/CCWC.2019.8666449

Publisher

Institute of Electrical and Electronics Engineers

Abstract

Application of learning algorithms to robotics and control problems with highly nonlinear dynamics to obtain a plausible control policy in a continuous state space is expected to greatly facilitate the design process. Recently, policy search methods such as policy gradient in Reinforcement Learning (RL) have succeeded in coping with such complex systems. Nevertheless, they are slow in convergence speed and are prone to get stuck in local optima. To alleviate this, a Bayesian inference method based on Markov Chain Monte Carlo (MCMC), utilizing a multiplicative reward function, is proposed. This study aims to compare eNAC, a popular gradient based RL method, with the proposed Bayesian learning method, where the objective is trajectory control...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Bayesian learning for policy search in trajectory control of a planar manipulator

Abstract

Extracted data

Bayesian learning for policy search in trajectory control of a planar manipulator

Abstract

Extracted data

Related items

Related items