While reinforcement learning has led to promising results in robotics, defining an informative reward function can sometimes prove to be challenging. Prior work considered including the human in the loop to jointly learn the reward function and the optimal policy. Generating samples from a physical robot and requesting human feedback are both taxing efforts for which efficiency is critical. In contrast to prior work, in this paper we propose to learn reward functions from both the robot and the human perspectives in order to improve on both efficiency metrics. On one side, learning a reward function from the human perspective increases feedback efficiency by assuming that humans rank trajectories according to an outcome space of reduced dim...
Deploying learning systems in the real-world requires aligning their objectives with those of the hu...
Recent successes combine reinforcement learning algorithms and deep neural networks, despite reinfor...
Mastering robotic manipulation skills through reinforcement learning (RL) typically requires the des...
University of Technology Sydney. Faculty of Engineering and Information Technology.A promising metho...
Humans can leverage physical interaction to teach robot arms. This physical interaction takes multip...
AbstractWhile Reinforcement Learning (RL) is not traditionally designed for interactive supervisory ...
As robots become a mass consumer product, they will need to learn new skills by interacting with typ...
The current reward learning from human preferences could be used to resolve complex reinforcement le...
Robots are on the verge of becoming ubiquitous. In the form of affordable humanoid toy robots, auton...
Off-the-shelf Reinforcement Learning (RL) algorithms suffer from slow learning performance, partly b...
Practical implementations of deep reinforcement learning (deep RL) have been challenging due to an a...
The task of learning a reward function from expert demonstrations suffers from high sample complexit...
The utility of reinforcement learning is limited by the alignment of reward functions with the inter...
International audienceObserving a human demonstrator manipulate objects provides a rich, scalable an...
Applying Deep Reinforcement Learning (DRL) to Human-Robot Cooperation (HRC) in dynamic control probl...
Deploying learning systems in the real-world requires aligning their objectives with those of the hu...
Recent successes combine reinforcement learning algorithms and deep neural networks, despite reinfor...
Mastering robotic manipulation skills through reinforcement learning (RL) typically requires the des...
University of Technology Sydney. Faculty of Engineering and Information Technology.A promising metho...
Humans can leverage physical interaction to teach robot arms. This physical interaction takes multip...
AbstractWhile Reinforcement Learning (RL) is not traditionally designed for interactive supervisory ...
As robots become a mass consumer product, they will need to learn new skills by interacting with typ...
The current reward learning from human preferences could be used to resolve complex reinforcement le...
Robots are on the verge of becoming ubiquitous. In the form of affordable humanoid toy robots, auton...
Off-the-shelf Reinforcement Learning (RL) algorithms suffer from slow learning performance, partly b...
Practical implementations of deep reinforcement learning (deep RL) have been challenging due to an a...
The task of learning a reward function from expert demonstrations suffers from high sample complexit...
The utility of reinforcement learning is limited by the alignment of reward functions with the inter...
International audienceObserving a human demonstrator manipulate objects provides a rich, scalable an...
Applying Deep Reinforcement Learning (DRL) to Human-Robot Cooperation (HRC) in dynamic control probl...
Deploying learning systems in the real-world requires aligning their objectives with those of the hu...
Recent successes combine reinforcement learning algorithms and deep neural networks, despite reinfor...
Mastering robotic manipulation skills through reinforcement learning (RL) typically requires the des...