In this paper, we examine the equilibrium tracking and convergence properties of no-regret learning algorithms in continuous games that evolve over time. Specifically, we focus on learning via "mirror descent", a widely used class of no-regret learning schemes where players take small steps along their individual payoff gradients and then "mirror" the output back to their action sets. In this general context, we show that the induced sequence of play stays asymptotically close to the evolving equilibrium of the sequence of stage games (assuming they are strongly monotone), and converges to it if the game stabilizes to a strictly monotone limit. Our results apply to both gradient- and payoff-based feedback, i.e., the "bandit" case where play...
International audienceThis paper examines the equilibrium convergence properties of no-regret learni...
International audienceThis paper examines the equilibrium convergence properties of no-regret learni...
38 pagesIn this paper, we examine the long-term behavior of regret-minimizing agents in time-varying...
In this paper, we examine the equilibrium tracking and convergence properties of no-regret learning ...
International audienceIn this paper, we examine the equilibrium tracking and convergence properties ...
In this paper, we examine the equilibrium tracking and convergence properties of no-regret learning ...
In this paper, we examine the equilibrium tracking and convergence properties of no-regret learning ...
International audienceIn this paper, we examine the equilibrium tracking and convergence properties ...
International audienceIn this paper, we examine the equilibrium tracking and convergence properties ...
34 pagesInternational audienceWe examine the long-run behavior of multi-agent online learning in gam...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
International audienceThis paper examines the equilibrium convergence properties of no-regret learni...
International audienceThis paper examines the equilibrium convergence properties of no-regret learni...
International audienceThis paper examines the equilibrium convergence properties of no-regret learni...
International audienceThis paper examines the equilibrium convergence properties of no-regret learni...
International audienceThis paper examines the equilibrium convergence properties of no-regret learni...
International audienceThis paper examines the equilibrium convergence properties of no-regret learni...
38 pagesIn this paper, we examine the long-term behavior of regret-minimizing agents in time-varying...
In this paper, we examine the equilibrium tracking and convergence properties of no-regret learning ...
International audienceIn this paper, we examine the equilibrium tracking and convergence properties ...
In this paper, we examine the equilibrium tracking and convergence properties of no-regret learning ...
In this paper, we examine the equilibrium tracking and convergence properties of no-regret learning ...
International audienceIn this paper, we examine the equilibrium tracking and convergence properties ...
International audienceIn this paper, we examine the equilibrium tracking and convergence properties ...
34 pagesInternational audienceWe examine the long-run behavior of multi-agent online learning in gam...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
International audienceThis paper examines the equilibrium convergence properties of no-regret learni...
International audienceThis paper examines the equilibrium convergence properties of no-regret learni...
International audienceThis paper examines the equilibrium convergence properties of no-regret learni...
International audienceThis paper examines the equilibrium convergence properties of no-regret learni...
International audienceThis paper examines the equilibrium convergence properties of no-regret learni...
International audienceThis paper examines the equilibrium convergence properties of no-regret learni...
38 pagesIn this paper, we examine the long-term behavior of regret-minimizing agents in time-varying...