The application of decision making and learning algorithms to multi-agent systems presents many interestingresearch challenges and opportunities. Among these is the ability for agents to learn how to act by observing or imitating other agents. We describe an algorithm, the IQ-algorithm, that integrates imitation with Q-learning. Roughly, a Q-learner uses the observations it has made of an "expert" agent to bias its exploration in promising directions. This algorithm goes beyond previous work in this direction by relaxing the oft-made assumptions that the learner (observer) and the expert (observed agent) share the same objectives and abilities. Our preliminary experiments demonstrate significant transfer between agents using the I...
Abstract. In a persistent multi-agent system, it should be possible for new agents to benefit from t...
Being able to accomplish tasks with multiple learners through learning has long been a goal of the m...
On-line learning methods have been applied successfully in multi-agent systems to achieve coordinati...
Imitation can be viewed as a means of enhancing learning in multiagent environments. It augments an...
Imitation is an example of social learning in which an individual observes and copies another's acti...
Imitation is actively being studied as an effective means of learning in multi-agent environments. I...
Abstract. Reinforcement learning techniques are increasingly being used to solve dicult problems in ...
A common problem in Reinforcement Learning (RL) is that the reward function is hard to express. This...
Imitation can be viewed as a means of enhancing learning in multiagent environments. It augments an ...
The promise of imitation is to facilitate learning by allowing the learner to ob-serve a teacher in ...
Abstract. The number of proposed reinforcement learning algorithms appears to be ever-growing. This ...
Context. Developing an Artificial Intelligence (AI) agent that canpredict and act in all possible si...
Most research in reinforcement learning has focused on stationary environments. In this paper, we pr...
Most research in reinforcement learning has focused on stationary environments. In this paper, we pr...
Mutual learning is an emerging field in intelligent systems which takes inspiration from naturally i...
Abstract. In a persistent multi-agent system, it should be possible for new agents to benefit from t...
Being able to accomplish tasks with multiple learners through learning has long been a goal of the m...
On-line learning methods have been applied successfully in multi-agent systems to achieve coordinati...
Imitation can be viewed as a means of enhancing learning in multiagent environments. It augments an...
Imitation is an example of social learning in which an individual observes and copies another's acti...
Imitation is actively being studied as an effective means of learning in multi-agent environments. I...
Abstract. Reinforcement learning techniques are increasingly being used to solve dicult problems in ...
A common problem in Reinforcement Learning (RL) is that the reward function is hard to express. This...
Imitation can be viewed as a means of enhancing learning in multiagent environments. It augments an ...
The promise of imitation is to facilitate learning by allowing the learner to ob-serve a teacher in ...
Abstract. The number of proposed reinforcement learning algorithms appears to be ever-growing. This ...
Context. Developing an Artificial Intelligence (AI) agent that canpredict and act in all possible si...
Most research in reinforcement learning has focused on stationary environments. In this paper, we pr...
Most research in reinforcement learning has focused on stationary environments. In this paper, we pr...
Mutual learning is an emerging field in intelligent systems which takes inspiration from naturally i...
Abstract. In a persistent multi-agent system, it should be possible for new agents to benefit from t...
Being able to accomplish tasks with multiple learners through learning has long been a goal of the m...
On-line learning methods have been applied successfully in multi-agent systems to achieve coordinati...