Abstract. We propose a graph-based algorithm for apprenticeship learning when the reward features are noisy. Previous apprenticeship learning techniques learn a reward function by using only local state features. This can be a limitation in practice, as often some features are misspecified or subject to measurement noise. Our graphical framework, inspired from the work on Markov Random Fields, allows to alleviate this problem by propagating information between states, and rewarding policies that choose similar actions in adjacent states. We demonstrate the advantage of the proposed approach on grid-world navigation problems, and on the problem of teaching a robot to grasp novel objects in simulation.
Considering that expert's demonstrations are usually sub optimal and failed demonstrations ofte...
We consider the problem of apprenticeship learning where the examples, demon-strated by an expert, c...
We provide new theoretical results for apprenticeship learning, a variant of rein-forcement learning...
We consider the problem of apprenticeship learning where the examples, demonstrated by an expert, co...
We consider the problem of apprenticeship learning where the examples, demonstrated by an expert, co...
This paper develops a generalized appren-ticeship learning protocol for reinforcement-learning agent...
We consider the problem of apprenticeship learning when the expert's demonstration covers only a sma...
We consider learning in a Markov decision process where we are not explicitly given a reward functio...
As the field of robotic and humanoid systems expand, more research is being done on how to best cont...
International audienceThis paper deals with the problem of learning from demonstrations, where an ag...
In Apprenticeship Learning (AL), we are given a Markov Decision Process (MDP) without access to the ...
International audienceThis paper addresses the problem of apprenticeship learning, that is learning ...
International audienceThis paper addresses the problem of apprenticeship learning, that is learning ...
lille1.fr This paper deals with the problem of learning from demon-strations, where an agent called ...
We study the problem of an apprentice learning to behave in an environment with an unknown reward fu...
Considering that expert's demonstrations are usually sub optimal and failed demonstrations ofte...
We consider the problem of apprenticeship learning where the examples, demon-strated by an expert, c...
We provide new theoretical results for apprenticeship learning, a variant of rein-forcement learning...
We consider the problem of apprenticeship learning where the examples, demonstrated by an expert, co...
We consider the problem of apprenticeship learning where the examples, demonstrated by an expert, co...
This paper develops a generalized appren-ticeship learning protocol for reinforcement-learning agent...
We consider the problem of apprenticeship learning when the expert's demonstration covers only a sma...
We consider learning in a Markov decision process where we are not explicitly given a reward functio...
As the field of robotic and humanoid systems expand, more research is being done on how to best cont...
International audienceThis paper deals with the problem of learning from demonstrations, where an ag...
In Apprenticeship Learning (AL), we are given a Markov Decision Process (MDP) without access to the ...
International audienceThis paper addresses the problem of apprenticeship learning, that is learning ...
International audienceThis paper addresses the problem of apprenticeship learning, that is learning ...
lille1.fr This paper deals with the problem of learning from demon-strations, where an agent called ...
We study the problem of an apprentice learning to behave in an environment with an unknown reward fu...
Considering that expert's demonstrations are usually sub optimal and failed demonstrations ofte...
We consider the problem of apprenticeship learning where the examples, demon-strated by an expert, c...
We provide new theoretical results for apprenticeship learning, a variant of rein-forcement learning...