We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-agent games. We make the observation that in a competitive setting with adaptive agents an agent's actions will (likely) result in changes in the opponents policies. In addition to accounting for the estimated policies of the opponents, our algorithm also adjusts these future opponent policies by incorporating estimates of how the opponents change their policy as a reaction to ones own actions. We present results showing that agents that learn with this algorithm can successfully achieve high reward in competitive multi-agent games where myopic self-interested behavior conflicts with the long term individual interests of the players.We show t...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. Thi...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-...
Reinforcement learning can provide a robust and natural means for agents to learn how to coordinate ...
Multi-agent learning is a growing area of research. An important topic is to formulate how an agent ...
This article investigates the performance of independent reinforcement learners in multi-agent games...
Evolution of cooperation and competition can appear when multiple adaptive agents share a biological...
Evolution of cooperation and competition can appear when multiple adaptive agents share a biological...
In this paper, we address multi-agent decision problems where all agents share a common goal. This c...
The goal of a self-interested agent within a multi-agent system is to maximize its utility over time...
Being able to accomplish tasks with multiple learners through learning has long been a goal of the m...
Dynamic noncooperative multiagent systems are systems where self-interested agents interact with eac...
We propose a method for learning multi-agent policies to compete against multiple opponents. The met...
Some game theory approaches to solve multiagent reinforce-ment learning in self play, i.e. when agen...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. Thi...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-...
Reinforcement learning can provide a robust and natural means for agents to learn how to coordinate ...
Multi-agent learning is a growing area of research. An important topic is to formulate how an agent ...
This article investigates the performance of independent reinforcement learners in multi-agent games...
Evolution of cooperation and competition can appear when multiple adaptive agents share a biological...
Evolution of cooperation and competition can appear when multiple adaptive agents share a biological...
In this paper, we address multi-agent decision problems where all agents share a common goal. This c...
The goal of a self-interested agent within a multi-agent system is to maximize its utility over time...
Being able to accomplish tasks with multiple learners through learning has long been a goal of the m...
Dynamic noncooperative multiagent systems are systems where self-interested agents interact with eac...
We propose a method for learning multi-agent policies to compete against multiple opponents. The met...
Some game theory approaches to solve multiagent reinforce-ment learning in self play, i.e. when agen...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. Thi...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...