We consider online learning in multi-player smooth monotone games. Existing algorithms have limitations such as (1) being only applicable to strongly monotone games; (2) lacking the no-regret guarantee; (3) having only asymptotic or slow $O(\frac{1}{\sqrt{T}})$ last-iterate convergence rate to a Nash equilibrium. While the $O(\frac{1}{\sqrt{T}})$ rate is tight for a large class of algorithms including the well-studied extragradient algorithm and optimistic gradient algorithm, it is not optimal for all gradient-based algorithms. We propose the accelerated optimistic gradient (AOG) algorithm, the first doubly optimal no-regret learning algorithm for smooth monotone games. Namely, our algorithm achieves both (i) the optimal $O(\sqrt{T})$ reg...
International audienceIn this paper, we consider multi-agent learning via online gradient descent in...
International audienceUnderstanding the behavior of no-regret dynamics in general N-player games is ...
We develop an algorithmic framework for solving convex optimization problems using no-regret game dy...
Online gradient descent (OGD) is well known to be doubly optimal under strong convexity or monotonic...
This paper considers no-regret learning for repeated continuous-kernel games with lossy bandit feedb...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
We show that Optimistic Hedge -- a common variant of multiplicative-weights-updates with recency bia...
Recently, Daskalakis, Fishelson, and Golowich (DFG) (NeurIPS`21) showed that if all agents in a mult...
We show that, for any sufficiently small fixed $\epsilon > 0$, when both players in a general-sum tw...
We examine the problem of regret minimization when the learner is involved in a continuous game with...
International audienceIn this paper, we examine the equilibrium tracking and convergence properties ...
International audienceThis paper examines the problem of multi-agent learning in N-person non-cooper...
This paper examines the long-run behavior of learning with bandit feedback in non-cooperative concav...
AbstractNo-regret is described as one framework that game theorists and computer scientists have con...
In this paper, we provide a novel and simple algorithm, Clairvoyant Multiplicative Weights Updates (...
International audienceIn this paper, we consider multi-agent learning via online gradient descent in...
International audienceUnderstanding the behavior of no-regret dynamics in general N-player games is ...
We develop an algorithmic framework for solving convex optimization problems using no-regret game dy...
Online gradient descent (OGD) is well known to be doubly optimal under strong convexity or monotonic...
This paper considers no-regret learning for repeated continuous-kernel games with lossy bandit feedb...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
We show that Optimistic Hedge -- a common variant of multiplicative-weights-updates with recency bia...
Recently, Daskalakis, Fishelson, and Golowich (DFG) (NeurIPS`21) showed that if all agents in a mult...
We show that, for any sufficiently small fixed $\epsilon > 0$, when both players in a general-sum tw...
We examine the problem of regret minimization when the learner is involved in a continuous game with...
International audienceIn this paper, we examine the equilibrium tracking and convergence properties ...
International audienceThis paper examines the problem of multi-agent learning in N-person non-cooper...
This paper examines the long-run behavior of learning with bandit feedback in non-cooperative concav...
AbstractNo-regret is described as one framework that game theorists and computer scientists have con...
In this paper, we provide a novel and simple algorithm, Clairvoyant Multiplicative Weights Updates (...
International audienceIn this paper, we consider multi-agent learning via online gradient descent in...
International audienceUnderstanding the behavior of no-regret dynamics in general N-player games is ...
We develop an algorithmic framework for solving convex optimization problems using no-regret game dy...