Robots are often highly non-linear dynamical systems with many degrees of freedom, making solving motion problems computationally challenging. One solution has been reinforcement learning (RL), which learns through experimentation to automatically perform the near-optimal motions that complete a task. However, high-dimensional problems and task formulation often prove challenging for RL. We address these problems with PrEference Appraisal Reinforcement Learning (PEARL), which solves Preference Balancing Tasks (PBTs). PBTs define a problem as a set of preferences that the system must balance to achieve a goal. The method is appropriate for acceleration-controlled systems with continuous state-space and either discrete or continuous action sp...
Humans often demonstrate diverse behaviors due to their personal preferences, for instance related t...
Over the course of the last decade, the framework of reinforcement learning has developed into a pro...
Specifying a numeric reward function for reinforcement learning typically requires a lot of hand-tun...
Robots are often highly non-linear dynamical systems with many degrees of freedom, making solving mo...
Reinforcement learning (RL) techniques optimize the accumulated long-term reward of a suitably chose...
International audienceThis paper focuses on reinforcement learning (RL) with limited prior knowledge...
We consider the problem of preference based reinforcement learning (PbRL), where, unlike traditional...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
Reinforcement Learning for Robotics is a trending area of research with tremendous potential for wi...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2...
As most action generation problems of autonomous robots can be phrased in terms of sequential decisi...
This work tackles in-situ robotics: the goal is to learn a policy while the robot operates in the re...
The current reward learning from human preferences could be used to resolve complex reinforcement le...
International audienceAimed at on-board robot training, an approach hybridizing active preference le...
Humans often demonstrate diverse behaviors due to their personal preferences, for instance related t...
Over the course of the last decade, the framework of reinforcement learning has developed into a pro...
Specifying a numeric reward function for reinforcement learning typically requires a lot of hand-tun...
Robots are often highly non-linear dynamical systems with many degrees of freedom, making solving mo...
Reinforcement learning (RL) techniques optimize the accumulated long-term reward of a suitably chose...
International audienceThis paper focuses on reinforcement learning (RL) with limited prior knowledge...
We consider the problem of preference based reinforcement learning (PbRL), where, unlike traditional...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
Reinforcement Learning for Robotics is a trending area of research with tremendous potential for wi...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2...
As most action generation problems of autonomous robots can be phrased in terms of sequential decisi...
This work tackles in-situ robotics: the goal is to learn a policy while the robot operates in the re...
The current reward learning from human preferences could be used to resolve complex reinforcement le...
International audienceAimed at on-board robot training, an approach hybridizing active preference le...
Humans often demonstrate diverse behaviors due to their personal preferences, for instance related t...
Over the course of the last decade, the framework of reinforcement learning has developed into a pro...
Specifying a numeric reward function for reinforcement learning typically requires a lot of hand-tun...