Learning in general-sum games is unstable and frequently leads to socially undesirable (Pareto-dominated) outcomes. To mitigate this, Learning with Opponent-Learning Awareness (LOLA) introduced opponent shaping to this setting, by accounting for each agent's influence on their opponents' anticipated learning steps. However, the original LOLA formulation (and follow-up work) is inconsistent because LOLA models other agents as naive learners rather than LOLA agents. In previous work, this inconsistency was suggested as a cause of LOLA's failure to preserve stable fixed points (SFPs). First, we formalize consistency and show that higher-order LOLA (HOLA) solves LOLA's inconsistency problem if it converges. Second, we correct a claim made in th...
International audienceIn this paper, we examine the Nash equilibrium convergence properties of no-re...
A selfish learner seeks to maximize their own success, disregarding others. When success is measured...
Hindsight rationality is an approach to playing general-sum games that prescribes no-regret learning...
A growing number of learning methods are actually differentiable games whose players optimise multip...
A growing number of learning methods are actually differentiable games whose players optimise multip...
Gradient-based learning in multi-agent systems is difficult because the gradient derives from a firs...
Cheung and Piliouras (2020) recently showed that two variants of the Multiplicative Weights Update m...
Multi-agent settings are quickly gathering importance in machine learning. This includes a plethora ...
Cheung and Piliouras (2020) recently showed that two variants of the Multiplicative Weights Update m...
First published: 01 February 2019We study models of learning in games where agents with limited memo...
International audienceIn this paper, we examine the convergence rate of a wide range of regularized ...
Collaborative learning techniques have the potential to enable training machine learning models that...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
A machine learning (ML) system must learn not only to match the output of a target function on a tra...
Nash equilibrium can be interpreted as a steady state of a game where players hold correct beliefs a...
International audienceIn this paper, we examine the Nash equilibrium convergence properties of no-re...
A selfish learner seeks to maximize their own success, disregarding others. When success is measured...
Hindsight rationality is an approach to playing general-sum games that prescribes no-regret learning...
A growing number of learning methods are actually differentiable games whose players optimise multip...
A growing number of learning methods are actually differentiable games whose players optimise multip...
Gradient-based learning in multi-agent systems is difficult because the gradient derives from a firs...
Cheung and Piliouras (2020) recently showed that two variants of the Multiplicative Weights Update m...
Multi-agent settings are quickly gathering importance in machine learning. This includes a plethora ...
Cheung and Piliouras (2020) recently showed that two variants of the Multiplicative Weights Update m...
First published: 01 February 2019We study models of learning in games where agents with limited memo...
International audienceIn this paper, we examine the convergence rate of a wide range of regularized ...
Collaborative learning techniques have the potential to enable training machine learning models that...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
A machine learning (ML) system must learn not only to match the output of a target function on a tra...
Nash equilibrium can be interpreted as a steady state of a game where players hold correct beliefs a...
International audienceIn this paper, we examine the Nash equilibrium convergence properties of no-re...
A selfish learner seeks to maximize their own success, disregarding others. When success is measured...
Hindsight rationality is an approach to playing general-sum games that prescribes no-regret learning...