In this paper we study the question of life long learning of behaviors from human demonstrations by an intelligent system. One approach is to model the observed demonstrations by a stationary policy. Inverse rein-forcement learning, on the other hand, searches a reward function that makes the observed policy closed to optimal in the corresponding Markov decision process. This approach provides a model of the task solved by the demonstrator and has been shown to lead to better generalization in un-known contexts. However both approaches focus on learning a single task from the expert demonstration. In this paper we propose a feature learn-ing approach for inverse reinforcement learning in which several different tasks are demonstrated, but i...
Based on the premise that the most succinct representation of the behavior of an entity is its rewar...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
Reinforcement Learning (RL) is an effective approach to solve sequential decision making problems wh...
The goal of inverse reinforcement learning is to find a reward function for a Markov decision proces...
Learning desirable behavior from a limited number of demonstrations, also known as inverse reinforce...
We consider learning in a Markov decision process where we are not explicitly given a reward functio...
In traditional Reinforcement Learning (RL) [4], a single agent learns to act in an environment by op...
In decision-making problems reward function plays an important role in finding the best policy. Rein...
This purpose of this paper is to provide an overview of the theoretical background and applications ...
Inverse reinforcement learning (IRL) aims at estimating an unknown reward function optimized by some...
International audienceThis paper addresses the Inverse Reinforcement Learning (IRL) problem which is...
We study the problem of learning a policy in a Markov decision process (MDP) based on observations o...
A major challenge faced by machine learning community is the decision making problems under uncertai...
We generalise the problem of inverse reinforcement learning to multiple tasks, from multiple demonst...
Inverse Reinforcement Learning (IRL) deals with the problem of recovering the reward function optimi...
Based on the premise that the most succinct representation of the behavior of an entity is its rewar...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
Reinforcement Learning (RL) is an effective approach to solve sequential decision making problems wh...
The goal of inverse reinforcement learning is to find a reward function for a Markov decision proces...
Learning desirable behavior from a limited number of demonstrations, also known as inverse reinforce...
We consider learning in a Markov decision process where we are not explicitly given a reward functio...
In traditional Reinforcement Learning (RL) [4], a single agent learns to act in an environment by op...
In decision-making problems reward function plays an important role in finding the best policy. Rein...
This purpose of this paper is to provide an overview of the theoretical background and applications ...
Inverse reinforcement learning (IRL) aims at estimating an unknown reward function optimized by some...
International audienceThis paper addresses the Inverse Reinforcement Learning (IRL) problem which is...
We study the problem of learning a policy in a Markov decision process (MDP) based on observations o...
A major challenge faced by machine learning community is the decision making problems under uncertai...
We generalise the problem of inverse reinforcement learning to multiple tasks, from multiple demonst...
Inverse Reinforcement Learning (IRL) deals with the problem of recovering the reward function optimi...
Based on the premise that the most succinct representation of the behavior of an entity is its rewar...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
Reinforcement Learning (RL) is an effective approach to solve sequential decision making problems wh...