We introduce a new algorithm for the numerical computation of Nash equilibria of competitive two-player games. Our method is a natural generalization of gradient descent to the two-player setting where the update is given by the Nash equilibrium of a regularized bilinear local approximation of the underlying game. It avoids oscillatory and divergent behaviors seen in alternating gradient descent. Using numerical experiments and rigorous analysis, we provide a detailed comparison to methods based on optimism and consensus and show that our method avoids making any unnecessary changes to the gradient dynamics while achieving exponential (local) convergence for (locally) convex-concave zero sum games. Convergence and stability properties of ou...
In this paper we present an implementation and performance evaluation of a descent algorithm that wa...
Data-driven model training is increasingly relying on finding Nash equilibria with provable techniqu...
This paper examines the convergence of a broad class of distributed learning dynamics for games with...
We introduce a new algorithm for the numerical computation of Nash equilibria of competitive two-pla...
We study the problem of finding the Nash equilibrium in a two-player zero-sum Markov game. Due to it...
A core challenge in policy optimization in competitive Markov decision processes is the design of ef...
We study the problem of convergence to a stationary point in zero-sum games. We propose competitive ...
Algorithms designed for single-agent reinforcement learning (RL) generally fail to converge to equil...
Constrained competitive optimization involves multiple agents trying to minimize conflicting objecti...
Game theory and online optimization have a close relationship with each other. In some literature, o...
43 pages, 2 tablesInternational audienceLearning in stochastic games is a notoriously difficult prob...
Nash equilibrium is a central concept in game theory. Several Nash solvers exist, yet none scale to ...
summary:This paper proposes a distributed accelerated first-order continuous-time algorithm for $O({...
In this paper, we present a novel consensus-based zeroth-order algorithm tailored for non-convex mul...
We show that, for any sufficiently small fixed $\epsilon > 0$, when both players in a general-sum tw...
In this paper we present an implementation and performance evaluation of a descent algorithm that wa...
Data-driven model training is increasingly relying on finding Nash equilibria with provable techniqu...
This paper examines the convergence of a broad class of distributed learning dynamics for games with...
We introduce a new algorithm for the numerical computation of Nash equilibria of competitive two-pla...
We study the problem of finding the Nash equilibrium in a two-player zero-sum Markov game. Due to it...
A core challenge in policy optimization in competitive Markov decision processes is the design of ef...
We study the problem of convergence to a stationary point in zero-sum games. We propose competitive ...
Algorithms designed for single-agent reinforcement learning (RL) generally fail to converge to equil...
Constrained competitive optimization involves multiple agents trying to minimize conflicting objecti...
Game theory and online optimization have a close relationship with each other. In some literature, o...
43 pages, 2 tablesInternational audienceLearning in stochastic games is a notoriously difficult prob...
Nash equilibrium is a central concept in game theory. Several Nash solvers exist, yet none scale to ...
summary:This paper proposes a distributed accelerated first-order continuous-time algorithm for $O({...
In this paper, we present a novel consensus-based zeroth-order algorithm tailored for non-convex mul...
We show that, for any sufficiently small fixed $\epsilon > 0$, when both players in a general-sum tw...
In this paper we present an implementation and performance evaluation of a descent algorithm that wa...
Data-driven model training is increasingly relying on finding Nash equilibria with provable techniqu...
This paper examines the convergence of a broad class of distributed learning dynamics for games with...