This article investigates the performance of independent reinforcement learners in multi-agent games. Convergence to Nash equilibria and parameter settings for desired learning be-havior are discussed for Q-learning, Frequency Maximum Q value (FMQ) learning and lenient Q-learning. FMQ and lenient Q-learning are shown to outperform regular Q-learning significantly in the context of coordination games with mis-coordination penalties. Furthermore, Q-learning with an -greedy and FMQ learning with a Boltzmann action selection are shown to scale well to games with one thousand agents
Abstract. In this paper we compare state-of-the-art multi-agent rein-forcement learning algorithms i...
Machine learning and artificial intelligence has been a hot topic the last few years, thanks to impr...
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents ’ ...
Reinforcement learning can provide a robust and natural means for agents to learn how to coordinate ...
In this paper we compare state-of-the-art multi-agent reinforcement learning algorithms in a wide va...
Multi-agent learning plays an increasingly important role in solving complex dynamic problems in to-...
The single-agent multi-armed bandit problem can be solved by an agent that learns the values of each...
In this paper, we address multi-agent decision problems where all agents share a common goal. This c...
We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-...
Being able to accomplish tasks with multiple learners through learning has long been a goal of the m...
Although well understood in the single-agent framework, the use of traditional reinforcement learnin...
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that a...
Repeated play in games by simple adaptive agents is investigated. The agents use Q-learning, a speci...
This paper introduces a new multi-agent learning algorithm for stochastic games based on replicator ...
Two multi-agent policy iteration learning algorithms are proposed in this work. The two proposed alg...
Abstract. In this paper we compare state-of-the-art multi-agent rein-forcement learning algorithms i...
Machine learning and artificial intelligence has been a hot topic the last few years, thanks to impr...
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents ’ ...
Reinforcement learning can provide a robust and natural means for agents to learn how to coordinate ...
In this paper we compare state-of-the-art multi-agent reinforcement learning algorithms in a wide va...
Multi-agent learning plays an increasingly important role in solving complex dynamic problems in to-...
The single-agent multi-armed bandit problem can be solved by an agent that learns the values of each...
In this paper, we address multi-agent decision problems where all agents share a common goal. This c...
We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-...
Being able to accomplish tasks with multiple learners through learning has long been a goal of the m...
Although well understood in the single-agent framework, the use of traditional reinforcement learnin...
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that a...
Repeated play in games by simple adaptive agents is investigated. The agents use Q-learning, a speci...
This paper introduces a new multi-agent learning algorithm for stochastic games based on replicator ...
Two multi-agent policy iteration learning algorithms are proposed in this work. The two proposed alg...
Abstract. In this paper we compare state-of-the-art multi-agent rein-forcement learning algorithms i...
Machine learning and artificial intelligence has been a hot topic the last few years, thanks to impr...
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents ’ ...