Most policy search (PS) algorithms require thousands of training episodes to find an effective policy, which is often infeasible with a physical robot. This survey article focuses on the extreme other end of the spectrum: how can a robot adapt with only a handful of trials (a dozen) and a few minutes? By analogy with the word "big-data", we refer to this challenge as "micro-data reinforcement learning". In this article, we show that a first strategy is to leverage prior knowledge on the policy structure (e.g., dynamic movement primitives), on the policy parameters (e.g., demonstrations), or on the dynamics (e.g., simulators). A second strategy is to create data-driven surrogate models of the expected reward (e.g., Bayesian optimization) or ...
In many complex robot applications, such as grasping and manipulation, it is difficult to program de...
International audiencePolicy improvement methods seek to optimize the parameters of a policy with re...
Abstract—Autonomous learning has been a promising direction in control and robotics for more than a ...
International audienceMost policy search algorithms require thousands of training episodes to find a...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
International audienceThe most data-efficient algorithms for reinforcement learning in robotics are ...
In robotics, controllers make the robot solve a task within a specific context. The context can des...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2...
Many motor skills in humanoid robotics can be learned using parametrized motor primitives. While suc...
© 2014 Elsevier B.V.In robotics, lower-level controllers are typically used to make the robot solve ...
<p>Reinforcement learning offers to robotics a framework and set of tools for the design of sophisti...
International audienceThe most data-efficient algorithms for reinforcement learning (RL) in robotics...
Data-driven approaches to the design of control policies for robotic systems have the potential to r...
Sample efficiency is one of the key factors when applying policy search to real-world problems. In r...
In many complex robot applications, such as grasping and manipulation, it is difficult to program de...
International audiencePolicy improvement methods seek to optimize the parameters of a policy with re...
Abstract—Autonomous learning has been a promising direction in control and robotics for more than a ...
International audienceMost policy search algorithms require thousands of training episodes to find a...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
International audienceThe most data-efficient algorithms for reinforcement learning in robotics are ...
In robotics, controllers make the robot solve a task within a specific context. The context can des...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2...
Many motor skills in humanoid robotics can be learned using parametrized motor primitives. While suc...
© 2014 Elsevier B.V.In robotics, lower-level controllers are typically used to make the robot solve ...
<p>Reinforcement learning offers to robotics a framework and set of tools for the design of sophisti...
International audienceThe most data-efficient algorithms for reinforcement learning (RL) in robotics...
Data-driven approaches to the design of control policies for robotic systems have the potential to r...
Sample efficiency is one of the key factors when applying policy search to real-world problems. In r...
In many complex robot applications, such as grasping and manipulation, it is difficult to program de...
International audiencePolicy improvement methods seek to optimize the parameters of a policy with re...
Abstract—Autonomous learning has been a promising direction in control and robotics for more than a ...