One of the ways to make reinforcement learning (RL) more ef- ficient is by utilizing human advice. Because human advice is expensive, the central question in advice-based reinforcement learning is, how to decide in which states the agent should ask for advice. To approach this challenge, various advice strate- gies have been proposed. Although all of these strategies dis- tribute advice more efficiently than naive strategies (such as choosing random states), they rely solely on the agent’s inter- nal representation of the task (the action-value function, the policy, etc.) and therefore, are rather inefficient when this rep- resentation is not accurate, in particular, in the early stages of the learning process. To address this weakness, we ...
Intelligent systems that interact with humans typically require demonstrations and/or advice from th...
We present a novel formulation for providing ad-vice to a reinforcement learner that employs support...
We consider the problem of incorporating end-user advice into re-inforcement learning (RL). In our s...
One of the ways to make reinforcement learning (RL) more efficient is by utilizing human advice. Bec...
An important issue in reinforcement learning is how to incorporate expert knowledge in a principled ...
Although Reinforcement Learning (RL) has been one of the most successful approaches for learning in ...
Although Reinforcement Learning (RL) has been one of the most successful approaches for learning in ...
In this article, we study the transfer learning model of action advice under a budget. We focus on r...
We consider sequential decision making problems under uncertainty, in which a user has a general ide...
International audienceWe consider sequential decision making problems under uncertainty , in which a...
International audienceWe consider sequential decision making problems under uncertainty , in which a...
Learning from reinforcements is a promising approach for creating intelligent agents. However, reinf...
We study a class of reinforcement learning tasks in which the agent receives its reward for complex,...
Abstract—Inverse Reinforcement Learning (IRL) is an ap-proach for domain-reward discovery from demon...
We consider the problem of incorporating end-user advice into reinforcement learning (RL). In our se...
Intelligent systems that interact with humans typically require demonstrations and/or advice from th...
We present a novel formulation for providing ad-vice to a reinforcement learner that employs support...
We consider the problem of incorporating end-user advice into re-inforcement learning (RL). In our s...
One of the ways to make reinforcement learning (RL) more efficient is by utilizing human advice. Bec...
An important issue in reinforcement learning is how to incorporate expert knowledge in a principled ...
Although Reinforcement Learning (RL) has been one of the most successful approaches for learning in ...
Although Reinforcement Learning (RL) has been one of the most successful approaches for learning in ...
In this article, we study the transfer learning model of action advice under a budget. We focus on r...
We consider sequential decision making problems under uncertainty, in which a user has a general ide...
International audienceWe consider sequential decision making problems under uncertainty , in which a...
International audienceWe consider sequential decision making problems under uncertainty , in which a...
Learning from reinforcements is a promising approach for creating intelligent agents. However, reinf...
We study a class of reinforcement learning tasks in which the agent receives its reward for complex,...
Abstract—Inverse Reinforcement Learning (IRL) is an ap-proach for domain-reward discovery from demon...
We consider the problem of incorporating end-user advice into reinforcement learning (RL). In our se...
Intelligent systems that interact with humans typically require demonstrations and/or advice from th...
We present a novel formulation for providing ad-vice to a reinforcement learner that employs support...
We consider the problem of incorporating end-user advice into re-inforcement learning (RL). In our s...