Abstract. The activity of dopaminergic (DA) neurons has been hypoth-esized to encode a reward prediction error (RPE) which corresponds to the error signal in Temporal Difference (TD) learning algorithms. This hypothesis has been reinforced by numerous studies showing the rele-vance of TD learning algorithms to describe the role of basal ganglia in classical conditioning. However, recent recordings of DA neurons dur-ing multi-choice tasks raised contradictory interpretations on whether DA’s RPE signal is action dependent or not. Thus the precise TD algo-rithm (i.e. Actor-Critic, Q-learning or SARSA) that best describes DA signals remains unknown. Here we simulate and precisely analyze these TD algorithms on a multi-choice task performed by r...
There is an odd irony associated with the by-now almost ineluctable tie between reinforcement learni...
Substantial evidence suggests that the phasic activities of dopaminergic neurons in the primate midb...
Temporal difference learning has been proposed as a model for Pavlovian conditioning, in which an an...
An open problem in the field of computational neuroscience is how to link synaptic plasticity to sys...
An open problem in the field of computational neuroscience is how to link synaptic plasticity to sys...
It has been proposed that the striatum plays a crucial role in learning to select appropriate action...
According to a series of influential models, dopamine (DA) neurons sig-nal reward prediction error u...
This article focuses on recent modeling studies of dopamine neuron activity and their influence on b...
In this thesis work, we modelled the role of dopamine in learning and in the processes of action sel...
Midbrain dopamine neurons signal reward prediction error (RPE), or actual minus expected reward. The...
Actor-critic algorithms for reinforcement learning are achieving renewed popular-ity due to their go...
Temporal-difference learning (TD) models explain most responses of primate dopamine neurons in appet...
Does dopamine code for uncertainty (Fiorillo, Tobler & Schultz, 2003; 2005) or is the sustained ...
ABSTRACT: Temporal difference learning (TD) is a popular algorithm in machine learning. Two learning...
SummaryReward prediction error (RPE) signals are central to current models of reward-learning. Tempo...
There is an odd irony associated with the by-now almost ineluctable tie between reinforcement learni...
Substantial evidence suggests that the phasic activities of dopaminergic neurons in the primate midb...
Temporal difference learning has been proposed as a model for Pavlovian conditioning, in which an an...
An open problem in the field of computational neuroscience is how to link synaptic plasticity to sys...
An open problem in the field of computational neuroscience is how to link synaptic plasticity to sys...
It has been proposed that the striatum plays a crucial role in learning to select appropriate action...
According to a series of influential models, dopamine (DA) neurons sig-nal reward prediction error u...
This article focuses on recent modeling studies of dopamine neuron activity and their influence on b...
In this thesis work, we modelled the role of dopamine in learning and in the processes of action sel...
Midbrain dopamine neurons signal reward prediction error (RPE), or actual minus expected reward. The...
Actor-critic algorithms for reinforcement learning are achieving renewed popular-ity due to their go...
Temporal-difference learning (TD) models explain most responses of primate dopamine neurons in appet...
Does dopamine code for uncertainty (Fiorillo, Tobler & Schultz, 2003; 2005) or is the sustained ...
ABSTRACT: Temporal difference learning (TD) is a popular algorithm in machine learning. Two learning...
SummaryReward prediction error (RPE) signals are central to current models of reward-learning. Tempo...
There is an odd irony associated with the by-now almost ineluctable tie between reinforcement learni...
Substantial evidence suggests that the phasic activities of dopaminergic neurons in the primate midb...
Temporal difference learning has been proposed as a model for Pavlovian conditioning, in which an an...