Reinforcement learning (RL) and trajectory optimization (TO) present strong complementary advantages. On one hand, RL approaches are able to learn global control policies directly from data, but generally require large sample sizes to properly converge towards feasible policies. On the other hand, TO methods are able to exploit gradient-based information extracted from simulators to quickly converge towards a locally optimal control trajectory which is only valid within the vicinity of the solution. Over the past decade, several approaches have aimed to adequately combine the two classes of methods in order to obtain the best of both worlds. Following on from this line of research, we propose several improvements on top of these approaches ...
Abstract—Reinforcement learning and policy search methods can in principle solve a wide range of con...
Recent advances in artificial intelligence are producing fascinating results in the field of compute...
Recent advances in machine learning, simulation, algorithm design, and computer hardware have allowe...
Reinforcement learning (RL) and trajectory optimization (TO) present strong complementary advantages...
Many of the recent trajectory optimization algorithms alternate between linear approximation of the...
We consider the problem of optimization in policy space for reinforcement learning. While a plethora...
Gradient-based methods have been widely used for system design and optimization in diverse applicati...
International audienceThe recent successes in deep reinforcement learning largely rely on the capabi...
Many of the recent Trajectory Optimization algorithms alternate between local approximation of the d...
Deep reinforcement learning uses simulators as abstract oracles to interact with the environment. In...
We consider the problem of optimization in policy space for reinforcement learning. While a plethora...
In robotics, elementary behaviour patterns often tackle control theoretic problems. Because of incom...
Robot programmers can often quickly program a robot to approximately execute a task under specific e...
Data-driven approaches to the design of control policies for robotic systems have the potential to r...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
Abstract—Reinforcement learning and policy search methods can in principle solve a wide range of con...
Recent advances in artificial intelligence are producing fascinating results in the field of compute...
Recent advances in machine learning, simulation, algorithm design, and computer hardware have allowe...
Reinforcement learning (RL) and trajectory optimization (TO) present strong complementary advantages...
Many of the recent trajectory optimization algorithms alternate between linear approximation of the...
We consider the problem of optimization in policy space for reinforcement learning. While a plethora...
Gradient-based methods have been widely used for system design and optimization in diverse applicati...
International audienceThe recent successes in deep reinforcement learning largely rely on the capabi...
Many of the recent Trajectory Optimization algorithms alternate between local approximation of the d...
Deep reinforcement learning uses simulators as abstract oracles to interact with the environment. In...
We consider the problem of optimization in policy space for reinforcement learning. While a plethora...
In robotics, elementary behaviour patterns often tackle control theoretic problems. Because of incom...
Robot programmers can often quickly program a robot to approximately execute a task under specific e...
Data-driven approaches to the design of control policies for robotic systems have the potential to r...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
Abstract—Reinforcement learning and policy search methods can in principle solve a wide range of con...
Recent advances in artificial intelligence are producing fascinating results in the field of compute...
Recent advances in machine learning, simulation, algorithm design, and computer hardware have allowe...