Abstract. Inverse reinforcement learning (IRL) addresses the problem of recovering a task description given a demonstration of the optimal policy used to solve such a task. The optimal policy is usually provided by an expert or teacher, making IRL specially suitable for the problem of apprenticeship learning. The task description is encoded in the form of a reward function of a Markov decision process (MDP). Several algo-rithms have been proposed to find the reward function corresponding to a set of demonstrations. One of the algorithms that has provided best results in different applications is a gradient method to optimize a policy squared error criterion. On a parallel line of research, other authors have presented recently a gradient ap...
A major challenge faced by machine learning community is the decision making problems under uncertai...
This work handles the inverse reinforcement learning (IRL) problem where only a small number of demo...
We provide new perspectives and inference algorithms for Maximum Entropy (MaxEnt) Inverse Reinforcem...
Inverse Reinforcement Learning (IRL) deals with the problem of recovering the reward function optimi...
Learning desirable behavior from a limited number of demonstrations, also known as inverse reinforce...
We consider learning in a Markov decision process where we are not explicitly given a reward functio...
The problem of learning an expert’s unknown reward function using a limited number of demonstrations...
Reinforcement Learning (RL) is an effective approach to solve sequential decision making problems wh...
In decision-making problems reward function plays an important role in finding the best policy. Rein...
We propose a novel formulation for the Inverse Reinforcement Learning (IRL) problem, which jointly a...
Reinforcement Learning (RL) methods provide a solution for decision-making problems under uncertaint...
Various methods for solving the inverse reinforcement learning (IRL) problem have been developed ind...
This purpose of this paper is to provide an overview of the theoretical background and applications ...
We study the problem of learning a policy in a Markov decision process (MDP) based on observations o...
One of the fundamental problems of artificial intelligence is learning how to behave optimally. With...
A major challenge faced by machine learning community is the decision making problems under uncertai...
This work handles the inverse reinforcement learning (IRL) problem where only a small number of demo...
We provide new perspectives and inference algorithms for Maximum Entropy (MaxEnt) Inverse Reinforcem...
Inverse Reinforcement Learning (IRL) deals with the problem of recovering the reward function optimi...
Learning desirable behavior from a limited number of demonstrations, also known as inverse reinforce...
We consider learning in a Markov decision process where we are not explicitly given a reward functio...
The problem of learning an expert’s unknown reward function using a limited number of demonstrations...
Reinforcement Learning (RL) is an effective approach to solve sequential decision making problems wh...
In decision-making problems reward function plays an important role in finding the best policy. Rein...
We propose a novel formulation for the Inverse Reinforcement Learning (IRL) problem, which jointly a...
Reinforcement Learning (RL) methods provide a solution for decision-making problems under uncertaint...
Various methods for solving the inverse reinforcement learning (IRL) problem have been developed ind...
This purpose of this paper is to provide an overview of the theoretical background and applications ...
We study the problem of learning a policy in a Markov decision process (MDP) based on observations o...
One of the fundamental problems of artificial intelligence is learning how to behave optimally. With...
A major challenge faced by machine learning community is the decision making problems under uncertai...
This work handles the inverse reinforcement learning (IRL) problem where only a small number of demo...
We provide new perspectives and inference algorithms for Maximum Entropy (MaxEnt) Inverse Reinforcem...