Regret-minimizing strategies for repeated games have been receiving increasing attention in the literature. These are simple adaptive behavior rules that lead to no regrets and, if followed by all players, exhibit nice convergence properties: the average play converges to correlated equilibrium, or even to Nash equilibrium in certain classes of games. However, the no-regret property relies on a strong assumption that each player treats her opponents as unresponsive and fully ignores the opponents' possible reactions to her actions. We show that if at least one player is slightly responsive, it is impossible to achieve no regrets, and convergence results for regret minimization with responsive opponents are unknown.</p
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
International audienceWe examine the problem of regret minimization when the learner is involved in ...
International audienceWe examine the problem of regret minimization when the learner is involved in ...
Regret-minimizing strategies for repeated games have been receiving increasing attention in the lite...
Regret-minimizing strategies for repeated games have been receiving increasing attention in the lite...
Regret-minimizing strategies for repeated games have been receiving increasing attention in the lite...
Regret minimizing strategies for repeated games have been receiving increasing attention in the lite...
This paper considers repeated games in which one player has a different objective than others. In pa...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
We consider repeated games in which the player, instead of observing the action chosen by the oppone...
38 pagesIn this paper, we examine the long-term behavior of regret-minimizing agents in time-varying...
38 pagesIn this paper, we examine the long-term behavior of regret-minimizing agents in time-varying...
Our work considers repeated games in which one player has a different objective than others. In part...
Discounted-sum games provide a formal model for the study of reinforcement learning, where the agent...
We exhibit and characterize an entire class of simple adaptive strategies, in the repeated play of a...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
International audienceWe examine the problem of regret minimization when the learner is involved in ...
International audienceWe examine the problem of regret minimization when the learner is involved in ...
Regret-minimizing strategies for repeated games have been receiving increasing attention in the lite...
Regret-minimizing strategies for repeated games have been receiving increasing attention in the lite...
Regret-minimizing strategies for repeated games have been receiving increasing attention in the lite...
Regret minimizing strategies for repeated games have been receiving increasing attention in the lite...
This paper considers repeated games in which one player has a different objective than others. In pa...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
We consider repeated games in which the player, instead of observing the action chosen by the oppone...
38 pagesIn this paper, we examine the long-term behavior of regret-minimizing agents in time-varying...
38 pagesIn this paper, we examine the long-term behavior of regret-minimizing agents in time-varying...
Our work considers repeated games in which one player has a different objective than others. In part...
Discounted-sum games provide a formal model for the study of reinforcement learning, where the agent...
We exhibit and characterize an entire class of simple adaptive strategies, in the repeated play of a...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
International audienceWe examine the problem of regret minimization when the learner is involved in ...
International audienceWe examine the problem of regret minimization when the learner is involved in ...