Policy Space Response Oracle methods (PSRO) provide a general solution to learn Nash equilibrium in two-player zero-sum games but suffer from two drawbacks: (1) the computation inefficiency due to the need for consistent meta-game evaluation via simulations, and (2) the exploration inefficiency due to finding the best response against a fixed meta-strategy at every epoch. In this work, we propose Efficient PSRO (EPSRO) that largely improves the efficiency of the above two steps. Central to our development is the newly-introduced subroutine of no-regret optimization on the unrestricted-restricted (URR) game. By solving URR at each epoch, one can evaluate the current game and compute the best response in one forward pass without the need for ...
We develop an efficient algorithm for computing pure strat-egy Nash equilibria that satisfy various ...
We define a class of zero-sum games with combinatorial structure, where the best response problem of...
We develop an efficient algorithm for computing pure strategy Nash equilibria that satisfy various c...
Solving Nash equilibrium is the key challenge in normal-form games with large strategy spaces, where...
Solving strategic games with huge action spaces is a critical yet under-explored topic in economics...
Developing scalable solution algorithms is one of the central problems in computational game theory....
International audienceIn this paper, we characterize the revision sets in different variants of the ...
In this paper, we introduce a two-player zero-sum framework between a trainable \emph{Solver} and a ...
Recent advances in multiagent learning have seen the introduction ofa family of algorithms that revo...
Hamilton-Jacobi-Isaacs (HJI) PDEs are the governing equations for the two-player general-sum games. ...
This article discusses two contributions to decision-making in complex partially observable stochast...
The quest for a PTAS for Nash equilibrium in a two-player game seeks to circumvent the PPAD-complete...
Regarding the approximation of Nash equilibria in games where the players have a continuum of strate...
We consider the capacitated selfish replication game with binary preferences. We first devise an alg...
In two-player non-cooperative games whose strategy sets are Hilbert spaces, in order to approach Nas...
We develop an efficient algorithm for computing pure strat-egy Nash equilibria that satisfy various ...
We define a class of zero-sum games with combinatorial structure, where the best response problem of...
We develop an efficient algorithm for computing pure strategy Nash equilibria that satisfy various c...
Solving Nash equilibrium is the key challenge in normal-form games with large strategy spaces, where...
Solving strategic games with huge action spaces is a critical yet under-explored topic in economics...
Developing scalable solution algorithms is one of the central problems in computational game theory....
International audienceIn this paper, we characterize the revision sets in different variants of the ...
In this paper, we introduce a two-player zero-sum framework between a trainable \emph{Solver} and a ...
Recent advances in multiagent learning have seen the introduction ofa family of algorithms that revo...
Hamilton-Jacobi-Isaacs (HJI) PDEs are the governing equations for the two-player general-sum games. ...
This article discusses two contributions to decision-making in complex partially observable stochast...
The quest for a PTAS for Nash equilibrium in a two-player game seeks to circumvent the PPAD-complete...
Regarding the approximation of Nash equilibria in games where the players have a continuum of strate...
We consider the capacitated selfish replication game with binary preferences. We first devise an alg...
In two-player non-cooperative games whose strategy sets are Hilbert spaces, in order to approach Nas...
We develop an efficient algorithm for computing pure strat-egy Nash equilibria that satisfy various ...
We define a class of zero-sum games with combinatorial structure, where the best response problem of...
We develop an efficient algorithm for computing pure strategy Nash equilibria that satisfy various c...