We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-agent games. We make the observation that in a competitive setting with adaptive agents an agent's actions will (likely) result in changes in the opponents policies. In addition to accounting for the estimated policies of the opponents, our algorithm also adjusts these future opponent policies by incorporating estimates of how the opponents change their policy as a reaction to ones own actions. We present results showing that agents that learn with this algorithm can successfully achieve high reward in competitive multi-agent games where myopic self-interested behavior conflicts with the long term individual interests of the players.We show t...
We present a novel and uniform formulation of the problem of reinforcement learning against bounded ...
Recently, the social dilemma problem is no longer limited to unrealistic stateless matrix games but ...
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents ’ ...
We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-...
Reinforcement learning can provide a robust and natural means for agents to learn how to coordinate ...
Evolution of cooperation and competition can appear when multiple adaptive agents share a biological...
Evolution of cooperation and competition can appear when multiple adaptive agents share a biological...
Being able to accomplish tasks with multiple learners through learning has long been a goal of the m...
This article investigates the performance of independent reinforcement learners in multi-agent games...
Multi-agent learning is a growing area of research. An important topic is to formulate how an agent ...
In this paper, we address multi-agent decision problems where all agents share a common goal. This c...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
Dynamic noncooperative multiagent systems are systems where self-interested agents interact with eac...
We propose a method for learning multi-agent policies to compete against multiple opponents. The met...
We present a novel and uniform formulation of the problem of reinforcement learning against bounded ...
Recently, the social dilemma problem is no longer limited to unrealistic stateless matrix games but ...
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents ’ ...
We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-...
Reinforcement learning can provide a robust and natural means for agents to learn how to coordinate ...
Evolution of cooperation and competition can appear when multiple adaptive agents share a biological...
Evolution of cooperation and competition can appear when multiple adaptive agents share a biological...
Being able to accomplish tasks with multiple learners through learning has long been a goal of the m...
This article investigates the performance of independent reinforcement learners in multi-agent games...
Multi-agent learning is a growing area of research. An important topic is to formulate how an agent ...
In this paper, we address multi-agent decision problems where all agents share a common goal. This c...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
Dynamic noncooperative multiagent systems are systems where self-interested agents interact with eac...
We propose a method for learning multi-agent policies to compete against multiple opponents. The met...
We present a novel and uniform formulation of the problem of reinforcement learning against bounded ...
Recently, the social dilemma problem is no longer limited to unrealistic stateless matrix games but ...
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents ’ ...