This paper investigates the problem of policy learning in multiagent environments using the stochastic game framework, which we briefly overview. We introduce two properties as desirable for a learning agent when in the presence of other learning agents, namely rationality and convergence. We examine existing reinforcement learning algorithms according to these two properties and notice that they fail to simultaneously meet both criteria. We then contribute a new learning algorithm, WoLF policy hillclimbing, that is based on a simple principle: “learn quickly while losing, slowly while winning. ” The algorithm is proven to be rational and we present empirical results for a number of stochastic games showing the algorithm converges.
Although multi-agent reinforcement learning can tackle systems of strategically interacting entities...
Hirsch [2], is called smooth fictitious play. Using techniques from stochastic approximation by the ...
This article investigates the performance of independent reinforcement learners in multi-agent games...
This paper investigates the problem of policy learn-ing in multiagent environments using the stochas...
AbstractLearning to act in a multiagent environment is a difficult problem since the normal definiti...
Stochastic games are a general model of interaction between multiple agents. They have recently been...
This paper provides a genera1 framework to analyze rational learning in strategic situations where t...
Two multi-agent policy iteration learning algorithms are proposed in this work. The two proposed alg...
Learning behaviors in a multiagent environment is crucial for developing and adapting multiagent sys...
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that a...
In this paper, we address multi-agent decision problems where all agents share a common goal. This c...
Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. Thi...
Inspired by the recent results in policy gradient learning in a general-sum game scenario, in the fo...
Agent competition and coordination are two classical and most important tasks in multiagent systems....
39 pages, 6 figures, 1 tableWe develop a unified stochastic approximation framework for analyzing th...
Although multi-agent reinforcement learning can tackle systems of strategically interacting entities...
Hirsch [2], is called smooth fictitious play. Using techniques from stochastic approximation by the ...
This article investigates the performance of independent reinforcement learners in multi-agent games...
This paper investigates the problem of policy learn-ing in multiagent environments using the stochas...
AbstractLearning to act in a multiagent environment is a difficult problem since the normal definiti...
Stochastic games are a general model of interaction between multiple agents. They have recently been...
This paper provides a genera1 framework to analyze rational learning in strategic situations where t...
Two multi-agent policy iteration learning algorithms are proposed in this work. The two proposed alg...
Learning behaviors in a multiagent environment is crucial for developing and adapting multiagent sys...
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that a...
In this paper, we address multi-agent decision problems where all agents share a common goal. This c...
Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. Thi...
Inspired by the recent results in policy gradient learning in a general-sum game scenario, in the fo...
Agent competition and coordination are two classical and most important tasks in multiagent systems....
39 pages, 6 figures, 1 tableWe develop a unified stochastic approximation framework for analyzing th...
Although multi-agent reinforcement learning can tackle systems of strategically interacting entities...
Hirsch [2], is called smooth fictitious play. Using techniques from stochastic approximation by the ...
This article investigates the performance of independent reinforcement learners in multi-agent games...