For controlling high-dimensional robots, most stochastic optimal control algorithms use approximations of the system dynamics and of the cost function (e.g., using linearizations and Taylor expansions). These approximations are typically only locally correct, which might cause instabilities in the greedy policy updates, lead to oscillations or the algorithms diverge. To overcome these drawbacks, we add a regularization term to the cost function that punishes large policy update steps in the trajectory optimization procedure. We applied this concept to the Approximate Inference Control method (AICO), where the resulting algorithm guarantees convergence for uninformative initial solutions without complex hand-tuning of learning rates. We eval...
Reinforcement learning (RL) and trajectory optimization (TO) present strong complementary advantages...
A central goal of the robotics community is to develop general optimization algorithms for producing...
Reinforcement learning methods are being applied to control problems in robotics domain. These algor...
Abstract — For controlling high-dimensional robots, most stochastic optimal control algorithms use a...
This thesis studies the problem of designing reliable control laws of robotic systems operating in u...
Abstract—Reinforcement learning and policy search methods can in principle solve a wide range of con...
How does uncertainty affect a robot when attempting to generate a control policy to achieve some obj...
Reinforcement learning (RL) and trajectory optimization (TO) present strong complementary advantages...
Abstract — Slow convergence is a major problem for policy gradient methods. It is a consequence of t...
Optimal Feedback Control (OFC) has been proposed as an attractive movement generation strategy in go...
Decision making under uncertainty is an important problem in engineering that is traditionally appro...
We introduce a novel optimization-based motion planner, Stochastic Extended LQR (SELQR), which compu...
Reinforcement learning (RL) and trajectory optimization (TO) present strong complementary advantages...
Reinforcement learning (RL) and trajectory optimization (TO) present strong complementary advantages...
Reinforcement learning (RL) and trajectory optimization (TO) present strong complementary advantages...
Reinforcement learning (RL) and trajectory optimization (TO) present strong complementary advantages...
A central goal of the robotics community is to develop general optimization algorithms for producing...
Reinforcement learning methods are being applied to control problems in robotics domain. These algor...
Abstract — For controlling high-dimensional robots, most stochastic optimal control algorithms use a...
This thesis studies the problem of designing reliable control laws of robotic systems operating in u...
Abstract—Reinforcement learning and policy search methods can in principle solve a wide range of con...
How does uncertainty affect a robot when attempting to generate a control policy to achieve some obj...
Reinforcement learning (RL) and trajectory optimization (TO) present strong complementary advantages...
Abstract — Slow convergence is a major problem for policy gradient methods. It is a consequence of t...
Optimal Feedback Control (OFC) has been proposed as an attractive movement generation strategy in go...
Decision making under uncertainty is an important problem in engineering that is traditionally appro...
We introduce a novel optimization-based motion planner, Stochastic Extended LQR (SELQR), which compu...
Reinforcement learning (RL) and trajectory optimization (TO) present strong complementary advantages...
Reinforcement learning (RL) and trajectory optimization (TO) present strong complementary advantages...
Reinforcement learning (RL) and trajectory optimization (TO) present strong complementary advantages...
Reinforcement learning (RL) and trajectory optimization (TO) present strong complementary advantages...
A central goal of the robotics community is to develop general optimization algorithms for producing...
Reinforcement learning methods are being applied to control problems in robotics domain. These algor...