In this article, we study the transfer learning model of action advice under a budget. We focus on reinforcement learning teachers providing action advice to heterogeneous students playing the game of Pac-Man under a limited advice budget. First, we examine several critical factors affecting advice quality in this setting, such as the average performance of the teacher, its variance and the importance of reward discounting in advising. The experiments show that the best performers are not always the best teachers and reveal the non-trivial importance of the coefficient of variation (CV) as a statistic for choosing policies that generate advice. The CV statistic relates variance to the corresponding mean. Second, the article studies policy l...
Agents learning how to act in new environments can benefit from input from more experienced agents o...
Intelligent systems that interact with humans typically require demonstrations and/or advice from th...
Abstract. In some reinforcement learning problems an agent may be provided with a set of input polic...
One of the ways to make reinforcement learning (RL) more efficient is by utilizing human advice. Bec...
Learning from reinforcements is a promising approach for creating intelligent agents. However, reinf...
Action advising is a peer-to-peer knowledge exchange technique built on the teacher-student paradigm...
One of the ways to make reinforcement learning (RL) more ef- ficient is by utilizing human advice. B...
Reinforcement Learning has long been employed to solve sequential decision-making problems with mini...
An important issue in reinforcement learning is how to incorporate expert knowledge in a principled ...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
National audienceHumans' impressive learning abilities are partly due to their capacity to reuse inf...
National audienceHumans' impressive learning abilities are partly due to their capacity to reuse inf...
We study a class of reinforcement learning tasks in which the agent receives its reward for complex,...
The problem we consider in this paper is reinforcement learning with value advice. In this setting, ...
We consider the problem of incorporating end-user advice into reinforcement learning (RL). In our se...
Agents learning how to act in new environments can benefit from input from more experienced agents o...
Intelligent systems that interact with humans typically require demonstrations and/or advice from th...
Abstract. In some reinforcement learning problems an agent may be provided with a set of input polic...
One of the ways to make reinforcement learning (RL) more efficient is by utilizing human advice. Bec...
Learning from reinforcements is a promising approach for creating intelligent agents. However, reinf...
Action advising is a peer-to-peer knowledge exchange technique built on the teacher-student paradigm...
One of the ways to make reinforcement learning (RL) more ef- ficient is by utilizing human advice. B...
Reinforcement Learning has long been employed to solve sequential decision-making problems with mini...
An important issue in reinforcement learning is how to incorporate expert knowledge in a principled ...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
National audienceHumans' impressive learning abilities are partly due to their capacity to reuse inf...
National audienceHumans' impressive learning abilities are partly due to their capacity to reuse inf...
We study a class of reinforcement learning tasks in which the agent receives its reward for complex,...
The problem we consider in this paper is reinforcement learning with value advice. In this setting, ...
We consider the problem of incorporating end-user advice into reinforcement learning (RL). In our se...
Agents learning how to act in new environments can benefit from input from more experienced agents o...
Intelligent systems that interact with humans typically require demonstrations and/or advice from th...
Abstract. In some reinforcement learning problems an agent may be provided with a set of input polic...