34 pages, 6 figuresInternational audienceWe investigate a class of reinforcement learning dynamics in which each player plays a "regularized best response" to a score vector consisting of his actions' cumulative payoffs. Regularized best responses are single-valued regularizations of ordinary best responses obtained by maximizing the difference between a player's expected cumulative payoff and a (strongly) convex penalty term. In contrast to the class of smooth best response maps used in models of stochastic fictitious play, these penalty functions are not required to be infinitely steep at the boundary of the simplex; in fact, dropping this requirement gives rise to an important dichotomy between steep and nonsteep cases. In this general s...
ADInternational audienceConsider a two-player normal-form game repeated over time. We introduce an a...
ADInternational audienceConsider a two-player normal-form game repeated over time. We introduce an a...
ADInternational audienceConsider a two-player normal-form game repeated over time. We introduce an a...
Abstract. We investigate a class of reinforcement learning dynamics in which each player plays a “re...
Abstract. We investigate a class of reinforcement learning dynamics in which each player plays a “re...
34 pages, 6 figuresInternational audienceWe investigate a class of reinforcement learning dynamics i...
Recent models of learning in games have attempted to produce individual-level learning algorithms th...
Recent extensions to dynamic games of the well-known fictitious play learning procedure in static ga...
This paper studies the analytical properties of the reinforcement learning model proposed in Erev an...
International audienceRegularized learning is a fundamental technique in online optimization, machin...
International audienceIn this paper, we examine the Nash equilibrium convergence properties of no-re...
International audienceIn this paper, we examine the Nash equilibrium convergence properties of no-re...
This paper studies the analytical properties of the reinforcement learning model proposed in Erev an...
International audienceStarting from a heuristic learning scheme for N-person games, we derive a new ...
International audienceStarting from a heuristic learning scheme for N-person games, we derive a new ...
ADInternational audienceConsider a two-player normal-form game repeated over time. We introduce an a...
ADInternational audienceConsider a two-player normal-form game repeated over time. We introduce an a...
ADInternational audienceConsider a two-player normal-form game repeated over time. We introduce an a...
Abstract. We investigate a class of reinforcement learning dynamics in which each player plays a “re...
Abstract. We investigate a class of reinforcement learning dynamics in which each player plays a “re...
34 pages, 6 figuresInternational audienceWe investigate a class of reinforcement learning dynamics i...
Recent models of learning in games have attempted to produce individual-level learning algorithms th...
Recent extensions to dynamic games of the well-known fictitious play learning procedure in static ga...
This paper studies the analytical properties of the reinforcement learning model proposed in Erev an...
International audienceRegularized learning is a fundamental technique in online optimization, machin...
International audienceIn this paper, we examine the Nash equilibrium convergence properties of no-re...
International audienceIn this paper, we examine the Nash equilibrium convergence properties of no-re...
This paper studies the analytical properties of the reinforcement learning model proposed in Erev an...
International audienceStarting from a heuristic learning scheme for N-person games, we derive a new ...
International audienceStarting from a heuristic learning scheme for N-person games, we derive a new ...
ADInternational audienceConsider a two-player normal-form game repeated over time. We introduce an a...
ADInternational audienceConsider a two-player normal-form game repeated over time. We introduce an a...
ADInternational audienceConsider a two-player normal-form game repeated over time. We introduce an a...