Humans and animals face decision tasks in an uncertain multi-agent environment where an agent's strategy may change in time due to the co-adaptation of others strategies. The neuronal substrate and the computational algorithms underlying such adaptive decision making, however, is largely unknown. We propose a population coding model of spiking neurons with a policy gradient procedure that successfully acquires optimal strategies for classical game-theoretical tasks. The suggested population reinforcement learning reproduces data from human behavioral experiments for the blackjack and the inspector game. It performs optimally according to a pure (deterministic) and mixed (stochastic) Nash equilibrium, respectively. In contrast, temporal-diff...
We present a new algorithm for polynomial time learning of optimal behavior in stochastic games. Thi...
To gain insights into the neural basis of such adaptive decision-making processes, we investigated t...
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that a...
Humans and animals face decision tasks in an uncertain multi-agent environment where an agent’s stra...
We investigate a recently proposed model for decision learning in a population of spiking neurons wh...
Learning by reinforcement is important in shaping animal behavior, and in particular in behavioral d...
Recent models of learning in games have attempted to produce individual-level learning algorithms th...
Abstract The Nash equilibrium concept has previously been shown to be an important tool to understan...
The unifying theme of this thesis is the design and analysis of adaptive procedures that are aimed a...
This paper addresses the question of whether neural networks (NNs), a realistic cognitive model of h...
Previous studies have shown that non-human primates can generate highly stochastic choice behaviour,...
Population coding is widely regarded as a key mechanism for achieving reliable behavioral decisions....
Learning behaviors in a multiagent environment is crucial for developing and adapting multiagent sys...
Correlated equilibrium (Aumann, 1974, 1987) is an important generalization of the Nash equilibrium c...
The Nash equilibrium, the main solution concept in analytical game theory, cannot make precise predi...
We present a new algorithm for polynomial time learning of optimal behavior in stochastic games. Thi...
To gain insights into the neural basis of such adaptive decision-making processes, we investigated t...
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that a...
Humans and animals face decision tasks in an uncertain multi-agent environment where an agent’s stra...
We investigate a recently proposed model for decision learning in a population of spiking neurons wh...
Learning by reinforcement is important in shaping animal behavior, and in particular in behavioral d...
Recent models of learning in games have attempted to produce individual-level learning algorithms th...
Abstract The Nash equilibrium concept has previously been shown to be an important tool to understan...
The unifying theme of this thesis is the design and analysis of adaptive procedures that are aimed a...
This paper addresses the question of whether neural networks (NNs), a realistic cognitive model of h...
Previous studies have shown that non-human primates can generate highly stochastic choice behaviour,...
Population coding is widely regarded as a key mechanism for achieving reliable behavioral decisions....
Learning behaviors in a multiagent environment is crucial for developing and adapting multiagent sys...
Correlated equilibrium (Aumann, 1974, 1987) is an important generalization of the Nash equilibrium c...
The Nash equilibrium, the main solution concept in analytical game theory, cannot make precise predi...
We present a new algorithm for polynomial time learning of optimal behavior in stochastic games. Thi...
To gain insights into the neural basis of such adaptive decision-making processes, we investigated t...
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that a...