Stochastic Optimal Control (SOC) is typically used to plan a movement for a specific situation. While most SOC methods fail to generalize this movement plan to a new situation without re-planning, we present a SOC method that allows us to reuse the obtained policy in a new situation as the policy is more robust to slight de-viations from the initial movement plan. In order to improve the robustness of the policy, we employ information-theoretic policy updates that explicitly operate on trajectory distributions instead of single trajectories. To ensure a stable and smooth policy update, we limit the ‘distance ’ between the trajectory distributions of the old and the new control policy. The introduced bound offers a closed form solution for t...
Stochastic Optimal Control is an elegant and general framework for specifying and solving control pr...
Abstract: Recent work on path integral stochastic optimal control theory Theodorou et al. (2010a); T...
This brief presents a framework for input-optimal navigation under state constraints for vehicles ex...
proaches rely on samples to either obtain an estimate of the value function or a linearisation of th...
Many Stochastic Optimal Control (SOC) approaches rely on samples to either obtain an estimate of th...
Copyright © 2014 IEEEPresented at IEEE Symposium on Adaptive Dynamic Programming and Reinforcement L...
Trajectory-Centric Reinforcement Learning and Trajectory Optimization methods optimize a sequence of...
Abstract — For controlling high-dimensional robots, most stochastic optimal control algorithms use a...
UnrestrictedMotivated by the limitations of current optimal control and reinforcement learning metho...
A framework capable of computing optimal control policies for a continuous system in the presence of...
Abstract—We present a reformulation of the stochastic optimal control problem in terms of KL diverge...
Dynamic programming is a principal method for analyzing stochastic optimal control problems. However...
In this article, we present a generalized view on Path Integral Control (PIC) methods. PIC refers to...
Path integral stochastic optimal control based learning methods are among the most efficient and sca...
Modeling robot motion planning with uncertainty in a Bayesian framework leads to a computationally i...
Stochastic Optimal Control is an elegant and general framework for specifying and solving control pr...
Abstract: Recent work on path integral stochastic optimal control theory Theodorou et al. (2010a); T...
This brief presents a framework for input-optimal navigation under state constraints for vehicles ex...
proaches rely on samples to either obtain an estimate of the value function or a linearisation of th...
Many Stochastic Optimal Control (SOC) approaches rely on samples to either obtain an estimate of th...
Copyright © 2014 IEEEPresented at IEEE Symposium on Adaptive Dynamic Programming and Reinforcement L...
Trajectory-Centric Reinforcement Learning and Trajectory Optimization methods optimize a sequence of...
Abstract — For controlling high-dimensional robots, most stochastic optimal control algorithms use a...
UnrestrictedMotivated by the limitations of current optimal control and reinforcement learning metho...
A framework capable of computing optimal control policies for a continuous system in the presence of...
Abstract—We present a reformulation of the stochastic optimal control problem in terms of KL diverge...
Dynamic programming is a principal method for analyzing stochastic optimal control problems. However...
In this article, we present a generalized view on Path Integral Control (PIC) methods. PIC refers to...
Path integral stochastic optimal control based learning methods are among the most efficient and sca...
Modeling robot motion planning with uncertainty in a Bayesian framework leads to a computationally i...
Stochastic Optimal Control is an elegant and general framework for specifying and solving control pr...
Abstract: Recent work on path integral stochastic optimal control theory Theodorou et al. (2010a); T...
This brief presents a framework for input-optimal navigation under state constraints for vehicles ex...