We study the problem of training a principal in a multi-agent general-sum game using reinforcement learning (RL). Learning a robust principal policy requires anticipating the worst possible strategic responses of other agents, which is generally NP-hard. However, we show that no-regret dynamics can identify these worst-case responses in poly-time in smooth games. We propose a framework that uses this policy evaluation method for efficiently learning a robust principal policy using RL. This framework can be extended to provide robustness to boundedly rational agents too. Our motivating application is automated mechanism design: we empirically demonstrate our framework learns robust mechanisms in both matrix games and complex spatiotemporal g...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
We present a novel and uniform formulation of the problem of reinforcement learning against bounded ...
In multi-agent systems, intelligent agents are tasked with making decisions that have optimal outcom...
Robust decision-making in multiplayer games requires anticipating what reactions a player policy may...
While various multi-agent reinforcement learning methods have been proposed in cooperative settings,...
34 pages, 6 figuresInternational audienceWe investigate a class of reinforcement learning dynamics i...
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents ’ ...
We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-...
With the recent advances in solving large, zero-sum extensive form games, there is a growing interes...
Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. Thi...
To achieve general intelligence, agents must learn how to interact with others in a shared environme...
Dynamic zero-sum games are a model of multiagent decision-making that has been well-studied in the m...
Recent models of learning in games have attempted to produce individual-level learning algorithms th...
We show that adversarial reinforcement learning (ARL) can be used to produce market marking agents t...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
We present a novel and uniform formulation of the problem of reinforcement learning against bounded ...
In multi-agent systems, intelligent agents are tasked with making decisions that have optimal outcom...
Robust decision-making in multiplayer games requires anticipating what reactions a player policy may...
While various multi-agent reinforcement learning methods have been proposed in cooperative settings,...
34 pages, 6 figuresInternational audienceWe investigate a class of reinforcement learning dynamics i...
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents ’ ...
We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-...
With the recent advances in solving large, zero-sum extensive form games, there is a growing interes...
Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. Thi...
To achieve general intelligence, agents must learn how to interact with others in a shared environme...
Dynamic zero-sum games are a model of multiagent decision-making that has been well-studied in the m...
Recent models of learning in games have attempted to produce individual-level learning algorithms th...
We show that adversarial reinforcement learning (ARL) can be used to produce market marking agents t...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
We present a novel and uniform formulation of the problem of reinforcement learning against bounded ...
In multi-agent systems, intelligent agents are tasked with making decisions that have optimal outcom...