Abstract — For controlling high-dimensional robots, most stochastic optimal control algorithms use approximations of the system dynamics and of the cost function (e.g., using lin-earizations and Taylor expansions). These approximations are typically only locally correct, which might cause instabilities in the greedy policy updates, lead to oscillations or the algorithms diverge. To overcome these drawbacks, we add a regularization term to the cost function that punishes large policy update steps in the trajectory optimization procedure. We applied this concept to the Approximate Inference Control method (AICO), where the resulting algorithm guarantees convergence for uninformative initial solutions without complex hand-tuning of learning ra...
This thesis is mostly focused on reinforcement learning, which is viewed as an optimization problem:...
Motion planning and control problems are embedded and essential in almost all robotics applications....
Many Stochastic Optimal Control (SOC) approaches rely on samples to either obtain an estimate of th...
For controlling high-dimensional robots, most stochastic optimal control algorithms use approximatio...
proaches rely on samples to either obtain an estimate of the value function or a linearisation of th...
How does uncertainty affect a robot when attempting to generate a control policy to achieve some obj...
Abstract—Reinforcement learning and policy search methods can in principle solve a wide range of con...
Reinforcement learning methods are being applied to control problems in robotics domain. These algor...
Trajectory-Centric Reinforcement Learning and Trajectory Optimization methods optimize a sequence of...
This thesis studies the problem of designing reliable control laws of robotic systems operating in u...
Stochastic Optimal Control (SOC) is typically used to plan a movement for a specific situation. Whil...
Decision making under uncertainty is an important problem in engineering that is traditionally appro...
A central goal of the robotics community is to develop general optimization algorithms for producing...
Abstract Background and problem statement Model-free or learning-based control, in particular, reinf...
UnrestrictedMotivated by the limitations of current optimal control and reinforcement learning metho...
This thesis is mostly focused on reinforcement learning, which is viewed as an optimization problem:...
Motion planning and control problems are embedded and essential in almost all robotics applications....
Many Stochastic Optimal Control (SOC) approaches rely on samples to either obtain an estimate of th...
For controlling high-dimensional robots, most stochastic optimal control algorithms use approximatio...
proaches rely on samples to either obtain an estimate of the value function or a linearisation of th...
How does uncertainty affect a robot when attempting to generate a control policy to achieve some obj...
Abstract—Reinforcement learning and policy search methods can in principle solve a wide range of con...
Reinforcement learning methods are being applied to control problems in robotics domain. These algor...
Trajectory-Centric Reinforcement Learning and Trajectory Optimization methods optimize a sequence of...
This thesis studies the problem of designing reliable control laws of robotic systems operating in u...
Stochastic Optimal Control (SOC) is typically used to plan a movement for a specific situation. Whil...
Decision making under uncertainty is an important problem in engineering that is traditionally appro...
A central goal of the robotics community is to develop general optimization algorithms for producing...
Abstract Background and problem statement Model-free or learning-based control, in particular, reinf...
UnrestrictedMotivated by the limitations of current optimal control and reinforcement learning metho...
This thesis is mostly focused on reinforcement learning, which is viewed as an optimization problem:...
Motion planning and control problems are embedded and essential in almost all robotics applications....
Many Stochastic Optimal Control (SOC) approaches rely on samples to either obtain an estimate of th...