Applying Policy Gradient Reinforcement Learning to Optimise Robot Behaviours

Publication date

January 2010

Abstract

In robotics, elementary behaviour patterns often tackle control theoretic problems. Because of incomplete or imprecise models of the control system, the structure and the parameters of a control policy are unknown. These problems can be solved by reinforcement learning algorithms like policy gradient methods. They apply gradient descent in order to find a local optimum in the policy space with respect to a reward function. In this thesis, policy gradient learning is used to optimise a controller represented as a z-transformed rational function. This representation facilitates simultaneous optimisation of the control structure and its parameters in time space. The resulting controller can be analysed in terms of control theory to predict the...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Applying Policy Gradient Reinforcement Learning to Optimise Robot Behaviours

Abstract

Extracted data

Applying Policy Gradient Reinforcement Learning to Optimise Robot Behaviours

Abstract

Extracted data

Topics

Related items

Topics

Related items