Most policy search (PS) algorithms require thousands of training episodes to find an effective policy, which is often infeasible with a physical robot. This survey article focuses on the extreme other end of the spectrum: how can a robot adapt with only a handful of trials (a dozen) and a few minutes? By analogy with the word "big-data", we refer to this challenge as "micro-data reinforcement learning". In this article, we show that a first strategy is to leverage prior knowledge on the policy structure (e.g., dynamic movement primitives), on the policy parameters (e.g., demonstrations), or on the dynamics (e.g., simulators). A second strategy is to create data-driven surrogate models of the expected reward (e.g., Bayesian optimization) or ...
Many motor skills in humanoid robotics can be learned using parametrized motor primitives. While suc...
© 2014 Elsevier B.V.In robotics, lower-level controllers are typically used to make the robot solve ...
Many motor skills in humanoid robotics can be learned using parametrized motor primitives. While suc...
International audienceMost policy search algorithms require thousands of training episodes to find a...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
International audienceMost policy search (PS) algorithms require thousands of training episodes to f...
International audienceThe most data-efficient algorithms for reinforcement learning in robotics are ...
In robotics, controllers make the robot solve a task within a specific context. The context can des...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
Many motor skills in humanoid robotics can be learned using parametrized motor primitives. While suc...
Many motor skills in humanoid robotics can be learned using parametrized motor primitives. While suc...
© 2014 Elsevier B.V.In robotics, lower-level controllers are typically used to make the robot solve ...
Many motor skills in humanoid robotics can be learned using parametrized motor primitives. While suc...
International audienceMost policy search algorithms require thousands of training episodes to find a...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
International audienceMost policy search (PS) algorithms require thousands of training episodes to f...
International audienceThe most data-efficient algorithms for reinforcement learning in robotics are ...
In robotics, controllers make the robot solve a task within a specific context. The context can des...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
Many motor skills in humanoid robotics can be learned using parametrized motor primitives. While suc...
Many motor skills in humanoid robotics can be learned using parametrized motor primitives. While suc...
© 2014 Elsevier B.V.In robotics, lower-level controllers are typically used to make the robot solve ...
Many motor skills in humanoid robotics can be learned using parametrized motor primitives. While suc...