ADInternational audienceConsider a two-player normal-form game repeated over time. We introduce an adaptive learning procedure, where the players only observe their own realized payoff at each stage. We assume that agents do not know their own payoff function and have no information on the other player. Furthermore, we assume that they have restrictions on their own actions such that, at each stage, their choice is limited to a subset of their action set. We prove that the empirical distributions of play converge to the set of Nash equilibria for zero-sum and potential games, and games where one player has two actions
Fudenberg and Kreps consider adaptive learning processes, in the spirit of fictitious play, for inf...
In this paper (reinforcement) learning of decision makers that face many different games is studied....
The single-agent multi-armed bandit problem can be solved by an agent that learns the values of each...
ADInternational audienceConsider a two-player normal-form game repeated over time. We introduce an a...
28 pagesConsider a 2-player normal-form game repeated over time. We introduce an adaptive learning p...
In this paper, we provide a theoretical prediction of the way in which adaptive players behave in th...
Hirsch [2], is called smooth fictitious play. Using techniques from stochastic approximation by the ...
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that a...
This paper examines the convergence of payoffs and strategies in Erev and Roth's model of reinforcem...
We consider reinforcement learning algorithms in normal form games. Using two-timescales stochastic ...
Fudenberg and Kreps (1993) consider adaptive learning processes, in the spirit of ctitious play, for...
34 pages, 6 figuresInternational audienceWe investigate a class of reinforcement learning dynamics i...
International audienceWhile payoff-based learning models are almost exclusively devised for finite a...
Recent models of learning in games have attempted to produce individual-level learning algorithms th...
The present study focuses on a class of games with reinforcement-learning agents that adaptively cho...
Fudenberg and Kreps consider adaptive learning processes, in the spirit of fictitious play, for inf...
In this paper (reinforcement) learning of decision makers that face many different games is studied....
The single-agent multi-armed bandit problem can be solved by an agent that learns the values of each...
ADInternational audienceConsider a two-player normal-form game repeated over time. We introduce an a...
28 pagesConsider a 2-player normal-form game repeated over time. We introduce an adaptive learning p...
In this paper, we provide a theoretical prediction of the way in which adaptive players behave in th...
Hirsch [2], is called smooth fictitious play. Using techniques from stochastic approximation by the ...
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that a...
This paper examines the convergence of payoffs and strategies in Erev and Roth's model of reinforcem...
We consider reinforcement learning algorithms in normal form games. Using two-timescales stochastic ...
Fudenberg and Kreps (1993) consider adaptive learning processes, in the spirit of ctitious play, for...
34 pages, 6 figuresInternational audienceWe investigate a class of reinforcement learning dynamics i...
International audienceWhile payoff-based learning models are almost exclusively devised for finite a...
Recent models of learning in games have attempted to produce individual-level learning algorithms th...
The present study focuses on a class of games with reinforcement-learning agents that adaptively cho...
Fudenberg and Kreps consider adaptive learning processes, in the spirit of fictitious play, for inf...
In this paper (reinforcement) learning of decision makers that face many different games is studied....
The single-agent multi-armed bandit problem can be solved by an agent that learns the values of each...