We show that, for any sufficiently small fixed $\epsilon > 0$, when both players in a general-sum two-player (bimatrix) game employ optimistic mirror descent (OMD) with smooth regularization, learning rate $\eta = O(\epsilon^2)$ and $T = \Omega(\text{poly}(1/\epsilon))$ repetitions, either the dynamics reach an $\epsilon$-approximate Nash equilibrium (NE), or the average correlated distribution of play is an $\Omega(\text{poly}(\epsilon))$-strong coarse correlated equilibrium (CCE): any possible unilateral deviation does not only leave the player worse, but will decrease its utility by $\Omega(\text{poly}(\epsilon))$. As an immediate consequence, when the iterates of OMD are bounded away from being Nash equilibria in a bimatrix game, we gua...
International audienceIn this paper, we examine the convergence rate of a wide range of regularized ...
This paper considers no-regret learning for repeated continuous-kernel games with lossy bandit feedb...
The existence of simple, uncoupled no-regret dynamics that converge to correlated equilibria in norm...
We show that Optimistic Hedge -- a common variant of multiplicative-weights-updates with recency bia...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
Recently, Daskalakis, Fishelson, and Golowich (DFG) (NeurIPS`21) showed that if all agents in a mult...
We consider online learning in multi-player smooth monotone games. Existing algorithms have limitati...
International audienceUnderstanding the behavior of no-regret dynamics in general N-player games is ...
In this paper, we propose a second-order extension of the continuous-time game-theoretic mirror desc...
International audienceIn this paper, we examine the equilibrium tracking and convergence properties ...
International audienceThis paper examines the problem of multi-agent learning in N-person non-cooper...
Last lecture we proved that coarse correlated equilibria (CCE) are tractable, in a satisfy-ing sense...
This paper explores a fundamental connection between computational learning theory and game theory t...
This paper examines the long-run behavior of learning with bandit feedback in non-cooperative concav...
We now have an impressive list of tractability results — polynomial-time algorithms and quickly conv...
International audienceIn this paper, we examine the convergence rate of a wide range of regularized ...
This paper considers no-regret learning for repeated continuous-kernel games with lossy bandit feedb...
The existence of simple, uncoupled no-regret dynamics that converge to correlated equilibria in norm...
We show that Optimistic Hedge -- a common variant of multiplicative-weights-updates with recency bia...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
Recently, Daskalakis, Fishelson, and Golowich (DFG) (NeurIPS`21) showed that if all agents in a mult...
We consider online learning in multi-player smooth monotone games. Existing algorithms have limitati...
International audienceUnderstanding the behavior of no-regret dynamics in general N-player games is ...
In this paper, we propose a second-order extension of the continuous-time game-theoretic mirror desc...
International audienceIn this paper, we examine the equilibrium tracking and convergence properties ...
International audienceThis paper examines the problem of multi-agent learning in N-person non-cooper...
Last lecture we proved that coarse correlated equilibria (CCE) are tractable, in a satisfy-ing sense...
This paper explores a fundamental connection between computational learning theory and game theory t...
This paper examines the long-run behavior of learning with bandit feedback in non-cooperative concav...
We now have an impressive list of tractability results — polynomial-time algorithms and quickly conv...
International audienceIn this paper, we examine the convergence rate of a wide range of regularized ...
This paper considers no-regret learning for repeated continuous-kernel games with lossy bandit feedb...
The existence of simple, uncoupled no-regret dynamics that converge to correlated equilibria in norm...