Inverse reinforcement learning (1RL) aims to recover the reward function underlying a Markov Decision Process from behaviors of experts in support of decision-making. Most recent work on IRL assumes the same level of trustworthiness of all expert behaviors, and frames IRL as a process of seeking reward function that makes those behaviors appear (near)-optimal. However, it is common in reality that noisy expert behaviors disobeying the optimal policy exist, which may degrade the IRL performance significantly. To address this issue, in this paper, we develop a robust IRL framework that can accurately estimate the reward function in the presence of behavior noise. In particular, we focus on a special type of behavior noise referred to as spars...
Based on the premise that the most succinct representation of the behavior of an entity is its rewar...
Being able to infer the goals, preferences and limitations of humans is of key importance in designi...
We state the problem of inverse reinforcement learning in terms of preference elicitation, resulting...
Inverse reinforcement learning (IRL) aims to recover the reward function underlying a Markov Decisio...
Inverse Reinforcement Learning (IRL) aims to recover a reward function from expert demonstrations in...
Existing inverse reinforcement learning (IRL) algorithms have assumed each expert’s demonstrated tra...
International audienceInverse Reinforcement Learning (IRL) is an effective approach to recover a rew...
The aim of Inverse Reinforcement Learning (IRL) is to infer a reward function R from a policy pi. To...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
Learning desirable behavior from a limited number of demonstrations, also known as inverse reinforce...
Existing inverse reinforcement learning (IRL) algorithms have assumed each ex-pert’s demonstrated tr...
Inverse reinforcement learning~(IRL) is a powerful framework to infer an agent's reward function by ...
In decision-making problems reward function plays an important role in finding the best policy. Rein...
The problem of learning an expert’s unknown reward function using a limited number of demonstrations...
In the field of reinforcement learning there has been recent progress towards safety and high-confid...
Based on the premise that the most succinct representation of the behavior of an entity is its rewar...
Being able to infer the goals, preferences and limitations of humans is of key importance in designi...
We state the problem of inverse reinforcement learning in terms of preference elicitation, resulting...
Inverse reinforcement learning (IRL) aims to recover the reward function underlying a Markov Decisio...
Inverse Reinforcement Learning (IRL) aims to recover a reward function from expert demonstrations in...
Existing inverse reinforcement learning (IRL) algorithms have assumed each expert’s demonstrated tra...
International audienceInverse Reinforcement Learning (IRL) is an effective approach to recover a rew...
The aim of Inverse Reinforcement Learning (IRL) is to infer a reward function R from a policy pi. To...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
Learning desirable behavior from a limited number of demonstrations, also known as inverse reinforce...
Existing inverse reinforcement learning (IRL) algorithms have assumed each ex-pert’s demonstrated tr...
Inverse reinforcement learning~(IRL) is a powerful framework to infer an agent's reward function by ...
In decision-making problems reward function plays an important role in finding the best policy. Rein...
The problem of learning an expert’s unknown reward function using a limited number of demonstrations...
In the field of reinforcement learning there has been recent progress towards safety and high-confid...
Based on the premise that the most succinct representation of the behavior of an entity is its rewar...
Being able to infer the goals, preferences and limitations of humans is of key importance in designi...
We state the problem of inverse reinforcement learning in terms of preference elicitation, resulting...