Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting the basic gradient ascent approach with policy prediction. We prove that this augmentation results in a stronger notion of convergence than the basic gradient ascent, that is, strategies converge to a Nash equilibrium within a restricted class of iterated games. Motivated by this augmentation, we then propose a new practical multi-agent reinforcement learning (MARL) algorithm exploiting approximate policy prediction. Empirical results show that it converges faster and in a wider variety of situations than state-of-the-art MARL algorithms
Shoham et al. [1] identify several important agendas which can help direct research in multi-agent l...
This paper investigates the problem of policy learn-ing in multiagent environments using the stochas...
Abstract. In this paper we compare state-of-the-art multi-agent rein-forcement learning algorithms i...
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents ’ ...
Two multi-agent policy iteration learning algorithms are proposed in this work. The two proposed alg...
The policy gradient method is a popular technique for implementing reinforcement learning in an agen...
The policy gradient method is a popular technique for implementing reinforcement learning in an agen...
The policy gradient method is a popular technique for implementing reinforcement learning in an agen...
We propose a method for learning multi-agent policies to compete against multiple opponents. The met...
In this paper we compare state-of-the-art multi-agent reinforcement learning algorithms in a wide va...
We consider model-based multi-agent reinforcement learning, where the environment transition model i...
Inspired by the recent results in policy gradient learning in a general-sum game scenario, in the fo...
Abstract. The number of proposed reinforcement learning algorithms appears to be ever-growing. This ...
Although multi-agent reinforcement learning can tackle systems of strategically interacting entities...
This paper investigates the problem of policy learning in multiagent environments using the stochast...
Shoham et al. [1] identify several important agendas which can help direct research in multi-agent l...
This paper investigates the problem of policy learn-ing in multiagent environments using the stochas...
Abstract. In this paper we compare state-of-the-art multi-agent rein-forcement learning algorithms i...
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents ’ ...
Two multi-agent policy iteration learning algorithms are proposed in this work. The two proposed alg...
The policy gradient method is a popular technique for implementing reinforcement learning in an agen...
The policy gradient method is a popular technique for implementing reinforcement learning in an agen...
The policy gradient method is a popular technique for implementing reinforcement learning in an agen...
We propose a method for learning multi-agent policies to compete against multiple opponents. The met...
In this paper we compare state-of-the-art multi-agent reinforcement learning algorithms in a wide va...
We consider model-based multi-agent reinforcement learning, where the environment transition model i...
Inspired by the recent results in policy gradient learning in a general-sum game scenario, in the fo...
Abstract. The number of proposed reinforcement learning algorithms appears to be ever-growing. This ...
Although multi-agent reinforcement learning can tackle systems of strategically interacting entities...
This paper investigates the problem of policy learning in multiagent environments using the stochast...
Shoham et al. [1] identify several important agendas which can help direct research in multi-agent l...
This paper investigates the problem of policy learn-ing in multiagent environments using the stochas...
Abstract. In this paper we compare state-of-the-art multi-agent rein-forcement learning algorithms i...