How do animals learn to repeat behaviors that lead to the obtention of food or other “rewarding” objects? As a biologically plausible paradigm for learning in spiking neural networks, spike-timing dependent plasticity (STDP) has been shown to perform well in unsupervised learning tasks such as receptive field development. However, STDP fails to take behavioral relevance into account, and as such is inadequate to explain a vast range of learning tasks in which the final outcome, conditioned on the prior execution of a series of actions, is signaled to an animal through sparse rewards. In this thesis, I show that the addition of a third, global, reward-based factor to the pre- and postsynaptic factors of STDP is a promising solution to this p...
Reward-modulated spike timing dependent plasticity (STDP) combines unsupervised STDP with a reinforc...
Learning agents, whether natural or artificial, must update their internal parameters in order to im...
Abstract Organisms are able to learn from reward and punishment to cope with unknown situations, in ...
Recent experiments have shown that spike-timing-dependent plasticity is influenced by neuromodulatio...
Animals repeat rewarded behaviors, but the physiological basis of reward-based learning has only bee...
Animals repeat rewarded behaviors, but the physiological basis of reward-based learning has only bee...
Reward-modulated spike-timing-dependent plasticity (STDP) has recently emerged as a candidate for a ...
Animals repeat rewarded behaviors, but the physiological basis of reward-based learning has only bee...
The persistent modification of synaptic efficacy as a function of the rela-tive timing of pre- and p...
Biological neurons communicate primarily via a spiking process. Recurrently connected spiking neural...
A fundamental goal of neuroscience is to understand how cognitive processes, such as operant conditi...
Spike timing-dependent plasticity (STDP) is under neuromodulatory control, which is correlated with ...
A fundamental goal of neuroscience is to understand how cognitive processes, such as operant conditi...
Reward-modulated spike timing dependent plasticity (STDP) combines unsupervised STDP with a reinforc...
The basal ganglia (BG), and more specifically the striatum, have long been proposed to play an essen...
Reward-modulated spike timing dependent plasticity (STDP) combines unsupervised STDP with a reinforc...
Learning agents, whether natural or artificial, must update their internal parameters in order to im...
Abstract Organisms are able to learn from reward and punishment to cope with unknown situations, in ...
Recent experiments have shown that spike-timing-dependent plasticity is influenced by neuromodulatio...
Animals repeat rewarded behaviors, but the physiological basis of reward-based learning has only bee...
Animals repeat rewarded behaviors, but the physiological basis of reward-based learning has only bee...
Reward-modulated spike-timing-dependent plasticity (STDP) has recently emerged as a candidate for a ...
Animals repeat rewarded behaviors, but the physiological basis of reward-based learning has only bee...
The persistent modification of synaptic efficacy as a function of the rela-tive timing of pre- and p...
Biological neurons communicate primarily via a spiking process. Recurrently connected spiking neural...
A fundamental goal of neuroscience is to understand how cognitive processes, such as operant conditi...
Spike timing-dependent plasticity (STDP) is under neuromodulatory control, which is correlated with ...
A fundamental goal of neuroscience is to understand how cognitive processes, such as operant conditi...
Reward-modulated spike timing dependent plasticity (STDP) combines unsupervised STDP with a reinforc...
The basal ganglia (BG), and more specifically the striatum, have long been proposed to play an essen...
Reward-modulated spike timing dependent plasticity (STDP) combines unsupervised STDP with a reinforc...
Learning agents, whether natural or artificial, must update their internal parameters in order to im...
Abstract Organisms are able to learn from reward and punishment to cope with unknown situations, in ...