Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the domain of swarm robotics for instance, the expert can hardly design a reward function or demonstrate the target behavior, forbidding the use of both standard RL and inverse reinforce-ment learning. Although with a limited expertise, the human expert is still often able to emit preferences and rank the agent demonstrations. Earlier work has presented an iterative preference-based RL framework: expert preferences are exploited to learn an approximate policy return, thus enabling the agent to achieve direct policy search. Iteratively, the agent selects a new candidate policy and demonstrates it; the expert ranks the new demonstration comparatively...
Specifying a numeric reward function for reinforcement learning typically requires a lot of hand-tun...
The current reward learning from human preferences could be used to resolve complex reinforcement le...
Abstract. Many machine learning approaches in robotics, based on re-inforcement learning, inverse op...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
International audienceThis paper focuses on reinforcement learning (RL) with limited prior knowledge...
This work tackles in-situ robotics: the goal is to learn a policy while the robot operates in the re...
This paper makes a first step toward the integration of two subfields of machine learning, namely pr...
This paper makes a first step toward the integration of two subfields of machine learning, namely pr...
Reinforcement learning (RL) techniques optimize the accumulated long-term reward of a suitably chose...
Robots are often highly non-linear dynamical systems with many degrees of freedom, making solving mo...
Our goal is to efficiently learn reward functions encoding a human's preferences for how a dynamical...
Conveying complex objectives to reinforcement learning (RL) agents often requires meticulous reward ...
As most action generation problems of autonomous robots can be phrased in terms of sequential decisi...
New flexible teaching methods for robotics are needed to automate repetitive tasks that are currentl...
The field of deep reinforcement learning has seen major successes recently, achieving superhuman per...
Specifying a numeric reward function for reinforcement learning typically requires a lot of hand-tun...
The current reward learning from human preferences could be used to resolve complex reinforcement le...
Abstract. Many machine learning approaches in robotics, based on re-inforcement learning, inverse op...
Abstract. This paper focuses on reinforcement learning (RL) with lim-ited prior knowledge. In the do...
International audienceThis paper focuses on reinforcement learning (RL) with limited prior knowledge...
This work tackles in-situ robotics: the goal is to learn a policy while the robot operates in the re...
This paper makes a first step toward the integration of two subfields of machine learning, namely pr...
This paper makes a first step toward the integration of two subfields of machine learning, namely pr...
Reinforcement learning (RL) techniques optimize the accumulated long-term reward of a suitably chose...
Robots are often highly non-linear dynamical systems with many degrees of freedom, making solving mo...
Our goal is to efficiently learn reward functions encoding a human's preferences for how a dynamical...
Conveying complex objectives to reinforcement learning (RL) agents often requires meticulous reward ...
As most action generation problems of autonomous robots can be phrased in terms of sequential decisi...
New flexible teaching methods for robotics are needed to automate repetitive tasks that are currentl...
The field of deep reinforcement learning has seen major successes recently, achieving superhuman per...
Specifying a numeric reward function for reinforcement learning typically requires a lot of hand-tun...
The current reward learning from human preferences could be used to resolve complex reinforcement le...
Abstract. Many machine learning approaches in robotics, based on re-inforcement learning, inverse op...