Sample-based information-theoretic stochastic optimal control

Lioutikov, R.
Paraschos, A.
Peters, J.
Neumann, G.

Open PDF

Open link

Publication date

September 2014

DOI

10.1109/ICRA.2014.6907424

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Abstract

Many Stochastic Optimal Control (SOC) approaches rely on samples to either obtain an estimate of the value function or a linearisation of the underlying system model. However, these approaches typically neglect the fact that the accuracy of the policy update depends on the closeness of the resulting trajectory distribution to these samples. The greedy operator does not consider such closeness constraint to the samples. Hence, the greedy operator can lead to oscillations or even instabilities in the policy updates. Such undesired behaviour is likely to result in an inferior performance of the estimated policy. We reuse inspiration from the reinforcement learning community and relax the greedy operator used in SOC with an informat...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Sample-based information-theoretic stochastic optimal control

Abstract

Extracted data

Sample-based information-theoretic stochastic optimal control

Abstract

Extracted data

Related items

Related items