Humans can learn under a wide variety of feedback conditions. Reinforcement learning (RL), where a series of rewarded decisions must be made, is a particularly important type of learning. Computational and behavioral studies of RL have focused mainly on Markovian decision processes, where the next state depends on only the current state and action. Little is known about non-Markovian decision making, where the next state depends on more than the current state and action. Learning is non-Markovian, for example, when there is no unique mapping between actions and feedback. We have produced a model based on spiking neurons that can handle these non-Markovian conditions by performing policy gradient descent. Here, we examine the model’s perform...
AbstractIt is widely acknowledged that biological beings (animals) are not Markov: modelers generall...
General purpose intelligent learning agents cycle through (complex,non-MDP) sequences of observatio...
Theories of reward learning in neuroscience have focused on two families of algorithms thought to ca...
<div><p>Humans can learn under a wide variety of feedback conditions. Reinforcement learning (RL), w...
Humans and animals are capable of evaluating actions by considering their long-run future rewards th...
This thesis investigates mechanisms of human decision making, building on the fields of psychology a...
Following the novel paradigm developed by Van Roy and coauthors for reinforcement learning in arbitr...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
Computational models of learning have proved largely successful in characterizing potential mechanis...
Artificial agents have often been compared to humans in their ability to categorize images or play s...
Reinforcement learning (RL) studies the problem where an agent maximizes its cumulative reward throu...
We investigate learning in a setting where each period a population has to choose between two action...
SummaryReinforcement learning (RL) uses sequential experience with situations (“states”) and outcome...
Many of the decisions we make in our everyday lives are sequential and entail sparse rewards. While ...
In this paper, the question how spiking neural network (SNN) learns and fixes in its internal struct...
AbstractIt is widely acknowledged that biological beings (animals) are not Markov: modelers generall...
General purpose intelligent learning agents cycle through (complex,non-MDP) sequences of observatio...
Theories of reward learning in neuroscience have focused on two families of algorithms thought to ca...
<div><p>Humans can learn under a wide variety of feedback conditions. Reinforcement learning (RL), w...
Humans and animals are capable of evaluating actions by considering their long-run future rewards th...
This thesis investigates mechanisms of human decision making, building on the fields of psychology a...
Following the novel paradigm developed by Van Roy and coauthors for reinforcement learning in arbitr...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
Computational models of learning have proved largely successful in characterizing potential mechanis...
Artificial agents have often been compared to humans in their ability to categorize images or play s...
Reinforcement learning (RL) studies the problem where an agent maximizes its cumulative reward throu...
We investigate learning in a setting where each period a population has to choose between two action...
SummaryReinforcement learning (RL) uses sequential experience with situations (“states”) and outcome...
Many of the decisions we make in our everyday lives are sequential and entail sparse rewards. While ...
In this paper, the question how spiking neural network (SNN) learns and fixes in its internal struct...
AbstractIt is widely acknowledged that biological beings (animals) are not Markov: modelers generall...
General purpose intelligent learning agents cycle through (complex,non-MDP) sequences of observatio...
Theories of reward learning in neuroscience have focused on two families of algorithms thought to ca...