Abstract—Inverse Reinforcement Learning (IRL) is an ap-proach for domain-reward discovery from demonstration, where an agent mines the reward function of a Markov decision process by observing an expert acting in the domain. In the standard setting, it is assumed that the expert acts (nearly) optimally, and a large number of trajectories, i.e., training examples are avail-able for reward discovery (and consequently, learning domain behavior). These are not practical assumptions: trajectories are often noisy, and there can be a paucity of examples. Our novel approach incorporates advice-giving into the IRL framework to address these issues. Inspired by preference elicitation, a domain expert provides advice on states and actions (features) b...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
Reinforcement learning has become a widely used methodology for creating intelligent agents in a wid...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
Intelligent systems that interact with humans typically require demonstrations and/or advice from th...
We study a class of reinforcement learning tasks in which the agent receives its reward for complex,...
In decision-making problems reward function plays an important role in finding the best policy. Rein...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
A major challenge faced by machine learning community is the decision making problems under uncertai...
A major challenge faced by machine learning community is the decision making problems under uncertai...
An important issue in reinforcement learning is how to incorporate expert knowledge in a principled ...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
Reinforcement learning has become a widely used methodology for creating intelligent agents in a wid...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
Intelligent systems that interact with humans typically require demonstrations and/or advice from th...
We study a class of reinforcement learning tasks in which the agent receives its reward for complex,...
In decision-making problems reward function plays an important role in finding the best policy. Rein...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
A major challenge faced by machine learning community is the decision making problems under uncertai...
A major challenge faced by machine learning community is the decision making problems under uncertai...
An important issue in reinforcement learning is how to incorporate expert knowledge in a principled ...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...
Reinforcement learning has become a widely used methodology for creating intelligent agents in a wid...
In real-world applications, inferring the intentions of expert agents (e.g., human operators) can be...