The paper investigates a stochastic model where two agents (persons, companies, institutions, states, software agents or other) learn interactive behavior in a series of alternating moves. Each agent is assumed to perform "stimulus-response-consequence " learning, as studied in psychology. In the presented model, the response of one agent o the other agent's move is both the stimulus for the other agent's next move and part of the consequence for the other agent's previous move. After deriving general properties of the model, especially concerning convergence to limit cycles, we concentrate on an asymptotic case where the learning rate tends to zero ("slow learning"). In this case, the dynamics can be desc...
This paper considers a class of reinforcement-based learning (namely, perturbed learning automata) a...
Recent models of learning in games have attempted to produce individual-level learning algorithms th...
This paper addresses a mathematically tractable model of the Prisoner's Dilemma using the framework ...
The present study focuses on a class of games with reinforcement-learning agents that adaptively cho...
AbstractHumans and other animals can adapt their social behavior in response to environmental cues i...
A population of agents plays a stochastic dynamic game wherein there is an underlying state process ...
Learning behaviors in a multiagent environment is crucial for developing and adapting multiagent sys...
Algorithmically designed reward functions can influence groups of learning agents toward measurable ...
This paper investigates the problem of policy learn-ing in multiagent environments using the stochas...
Do boundedly rational players learn to choose equilibrium strategies as they play a game repeatedly?...
Abstract: This paper studies the analytical properties of the reinforcement learning model proposed ...
AbstractLearning to act in a multiagent environment is a difficult problem since the normal definiti...
We present a new algorithm for polynomial time learning of optimal behavior in stochastic games. Thi...
This paper studies the analytical properties of the reinforcement learning model proposed in Erev an...
Consider a game that is played repeatedly by two populations of agents. In fictitious play, agents l...
This paper considers a class of reinforcement-based learning (namely, perturbed learning automata) a...
Recent models of learning in games have attempted to produce individual-level learning algorithms th...
This paper addresses a mathematically tractable model of the Prisoner's Dilemma using the framework ...
The present study focuses on a class of games with reinforcement-learning agents that adaptively cho...
AbstractHumans and other animals can adapt their social behavior in response to environmental cues i...
A population of agents plays a stochastic dynamic game wherein there is an underlying state process ...
Learning behaviors in a multiagent environment is crucial for developing and adapting multiagent sys...
Algorithmically designed reward functions can influence groups of learning agents toward measurable ...
This paper investigates the problem of policy learn-ing in multiagent environments using the stochas...
Do boundedly rational players learn to choose equilibrium strategies as they play a game repeatedly?...
Abstract: This paper studies the analytical properties of the reinforcement learning model proposed ...
AbstractLearning to act in a multiagent environment is a difficult problem since the normal definiti...
We present a new algorithm for polynomial time learning of optimal behavior in stochastic games. Thi...
This paper studies the analytical properties of the reinforcement learning model proposed in Erev an...
Consider a game that is played repeatedly by two populations of agents. In fictitious play, agents l...
This paper considers a class of reinforcement-based learning (namely, perturbed learning automata) a...
Recent models of learning in games have attempted to produce individual-level learning algorithms th...
This paper addresses a mathematically tractable model of the Prisoner's Dilemma using the framework ...