According to a series of influential models, dopamine (DA) neurons sig-nal reward prediction error using a temporal-difference (TD) algorithm. We address a problem not convincingly solved in these accounts: how to maintain a representation of cues that predict delayed consequences. Our new model uses a TD rule grounded in partially observable semi-Markov processes, a formalism that captures two largely neglected features of DA experiments: hidden state and temporal variability. Previous models pre-dicted rewards using a tapped delay line representation of sensory inputs; we replace this with a more active process of inference about the under-lying state of the world. The DA system can then learn to map these inferred states to reward predic...
Abstract. The activity of dopaminergic (DA) neurons has been hypoth-esized to encode a reward predic...
Recent work has advanced our knowledge of phasic dopamine reward prediction error signals. The error...
Substantial data support a temporal difference (TD) model of dopamine (DA) neuron activity in which ...
Although the responses of dopamine neurons in the primate midbrain are well characterized as carryin...
Temporal-difference learning (TD) models explain most responses of primate dopamine neurons in appet...
This article focuses on recent modeling studies of dopamine neuron activity and their influence on b...
Midbrain dopamine neurons signal reward prediction error (RPE), or actual minus expected reward. The...
Does dopamine code for uncertainty (Fiorillo, Tobler & Schultz, 2003; 2005) or is the sustained ...
An open problem in the field of computational neuroscience is how to link synaptic plasticity to sys...
An open problem in the field of computational neuroscience is how to link synaptic plasticity to sys...
An open problem in the field of computational neuroscience is how to link synaptic plasticity to sys...
An open problem in the field of computational neuroscience is how to link synaptic plasticity to sys...
A cornerstone of current theorizing about dopamine’s computational role is the idea that the phasic ...
Substantial evidence suggests that the phasic activities of dopaminergic neurons in the primate midb...
SummaryReward prediction error (RPE) signals are central to current models of reward-learning. Tempo...
Abstract. The activity of dopaminergic (DA) neurons has been hypoth-esized to encode a reward predic...
Recent work has advanced our knowledge of phasic dopamine reward prediction error signals. The error...
Substantial data support a temporal difference (TD) model of dopamine (DA) neuron activity in which ...
Although the responses of dopamine neurons in the primate midbrain are well characterized as carryin...
Temporal-difference learning (TD) models explain most responses of primate dopamine neurons in appet...
This article focuses on recent modeling studies of dopamine neuron activity and their influence on b...
Midbrain dopamine neurons signal reward prediction error (RPE), or actual minus expected reward. The...
Does dopamine code for uncertainty (Fiorillo, Tobler & Schultz, 2003; 2005) or is the sustained ...
An open problem in the field of computational neuroscience is how to link synaptic plasticity to sys...
An open problem in the field of computational neuroscience is how to link synaptic plasticity to sys...
An open problem in the field of computational neuroscience is how to link synaptic plasticity to sys...
An open problem in the field of computational neuroscience is how to link synaptic plasticity to sys...
A cornerstone of current theorizing about dopamine’s computational role is the idea that the phasic ...
Substantial evidence suggests that the phasic activities of dopaminergic neurons in the primate midb...
SummaryReward prediction error (RPE) signals are central to current models of reward-learning. Tempo...
Abstract. The activity of dopaminergic (DA) neurons has been hypoth-esized to encode a reward predic...
Recent work has advanced our knowledge of phasic dopamine reward prediction error signals. The error...
Substantial data support a temporal difference (TD) model of dopamine (DA) neuron activity in which ...