International audienceMany machine learning approaches in robotics, based on re- inforcement learning, inverse optimal control or direct policy learning, critically rely on robot simulators. This paper investigates a simulator- free direct policy learning, called Preference-based Policy Learning (PPL). PPL iterates a four-step process: the robot demonstrates a candidate pol- icy; the expert ranks this policy comparatively to other ones according to her preferences; these preferences are used to learn a policy return estimate; the robot uses the policy return estimate to build new can- didate policies, and the process is iterated until the desired behavior is obtained. PPL requires a good representation of the policy search space be availabl...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
This paper proposes a high-level reinforcement learning (RL) control system for solving the action s...
The acquisition and improvement of motor skills and control policies for robotics from trial and err...
International audienceMany machine learning approaches in robotics, based on re- inforcement learnin...
Abstract. Many machine learning approaches in robotics, based on re-inforcement learning, inverse op...
This work tackles in-situ robotics: the goal is to learn a policy while the robot operates in the re...
International audienceAimed at on-board robot training, an approach hybridizing active preference le...
Policy Learning approaches are among the best suited methods for high-dimensional, continuous contro...
Policy search is a subfield in reinforcement learning which focuses on finding good parameters for ...
In robotics, controllers make the robot solve a task within a specific context. The context can des...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2...
A summary of the state-of-the-art reinforcement learning in robotics is given, in terms of both algo...
International audienceIn real-world robotic applications, many factors, both at low-level (e.g., vis...
Most policy search (PS) algorithms require thousands of training episodes to find an effective polic...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
This paper proposes a high-level reinforcement learning (RL) control system for solving the action s...
The acquisition and improvement of motor skills and control policies for robotics from trial and err...
International audienceMany machine learning approaches in robotics, based on re- inforcement learnin...
Abstract. Many machine learning approaches in robotics, based on re-inforcement learning, inverse op...
This work tackles in-situ robotics: the goal is to learn a policy while the robot operates in the re...
International audienceAimed at on-board robot training, an approach hybridizing active preference le...
Policy Learning approaches are among the best suited methods for high-dimensional, continuous contro...
Policy search is a subfield in reinforcement learning which focuses on finding good parameters for ...
In robotics, controllers make the robot solve a task within a specific context. The context can des...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2...
A summary of the state-of-the-art reinforcement learning in robotics is given, in terms of both algo...
International audienceIn real-world robotic applications, many factors, both at low-level (e.g., vis...
Most policy search (PS) algorithms require thousands of training episodes to find an effective polic...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
This paper proposes a high-level reinforcement learning (RL) control system for solving the action s...
The acquisition and improvement of motor skills and control policies for robotics from trial and err...