International audienceLearning by imitation, among the most promising techniques for reinforcement learning in complex domains, critically depends on the human designer ability to provide sufficiently many demonstrations of satisfactory quality. The approach presented in this paper, referred to as DIVA (Direct Value Learning for Reinforcement Learning), aims at addressing both above limitations by exploiting simple experiments. The approach stems from a straightforward remark: while it is rather easy to set a robot in a target situation, the quality of its situation will naturally deteriorate upon the action of naive controllers. The demonstration of such naive controllers can thus be used to learn directly a value function, through a prefe...
This paper makes a first step toward the integration of two subfields of machine learning, namely pr...
The goal of task transfer in reinforcement learning is migrating the action policy of an agent to th...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
International audienceLearning by imitation, among the most promising techniques for reinforcement l...
International audienceTaking inspiration from inverse reinforcement learning, the proposed Direct Va...
New flexible teaching methods for robotics are needed to automate repetitive tasks that are currentl...
<p>Reinforcement learning offers to robotics a framework and set of tools for the design of sophisti...
Robot acquiring skill is a process similar to human skill learning. Reinforcement learning (RL) is a...
AbstractThe impulsive preference of an animal for an immediate reward implies that it might subjecti...
The value function, at the core of the Bellmanian Reinforcement Learning framework, associates to ea...
The current reward learning from human preferences could be used to resolve complex reinforcement le...
Some imitation learning approaches rely on Inverse Reinforcement Learning (IRL) methods, to decode a...
Humans can leverage physical interaction to teach robot arms. This physical interaction takes multip...
Following the principle of human skill learning, robot acquiring skill is a process similar to human...
This work tackles in-situ robotics: the goal is to learn a policy while the robot operates in the re...
This paper makes a first step toward the integration of two subfields of machine learning, namely pr...
The goal of task transfer in reinforcement learning is migrating the action policy of an agent to th...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
International audienceLearning by imitation, among the most promising techniques for reinforcement l...
International audienceTaking inspiration from inverse reinforcement learning, the proposed Direct Va...
New flexible teaching methods for robotics are needed to automate repetitive tasks that are currentl...
<p>Reinforcement learning offers to robotics a framework and set of tools for the design of sophisti...
Robot acquiring skill is a process similar to human skill learning. Reinforcement learning (RL) is a...
AbstractThe impulsive preference of an animal for an immediate reward implies that it might subjecti...
The value function, at the core of the Bellmanian Reinforcement Learning framework, associates to ea...
The current reward learning from human preferences could be used to resolve complex reinforcement le...
Some imitation learning approaches rely on Inverse Reinforcement Learning (IRL) methods, to decode a...
Humans can leverage physical interaction to teach robot arms. This physical interaction takes multip...
Following the principle of human skill learning, robot acquiring skill is a process similar to human...
This work tackles in-situ robotics: the goal is to learn a policy while the robot operates in the re...
This paper makes a first step toward the integration of two subfields of machine learning, namely pr...
The goal of task transfer in reinforcement learning is migrating the action policy of an agent to th...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...