Robust Policy Updates for Stochastic Optimal Control

Elmar Rueckert
Max Mindt
Jan Peters
Gerhard Neumann

Publication date

October 2015

Abstract

Abstract — For controlling high-dimensional robots, most stochastic optimal control algorithms use approximations of the system dynamics and of the cost function (e.g., using lin-earizations and Taylor expansions). These approximations are typically only locally correct, which might cause instabilities in the greedy policy updates, lead to oscillations or the algorithms diverge. To overcome these drawbacks, we add a regularization term to the cost function that punishes large policy update steps in the trajectory optimization procedure. We applied this concept to the Approximate Inference Control method (AICO), where the resulting algorithm guarantees convergence for uninformative initial solutions without complex hand-tuning of learning ra...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Robust Policy Updates for Stochastic Optimal Control

Abstract

Extracted data

Robust Policy Updates for Stochastic Optimal Control

Abstract

Extracted data

Related items

Related items