In inverse reinforcement learning an observer infers the reward distribution available for actions in the environment solely through observing the actions implemented by another agent. To address whether this computational process is implemented in the human brain, participants underwent fMRI while learning about slot machines yielding hidden preferred and non-preferred food outcomes with varying probabilities, through observing the repeated slot choices of agents with similar and dissimilar food preferences. Using formal model comparison, we found that participants implemented inverse RL as opposed to a simple imitation strategy, in which the actions of the other agent are copied instead of inferring the underlying reward structure of the ...
Reinforcement learning (RL) provides a framework involving two diverse approaches to reward-based de...
Artificial intelligence, particularly through recent advancements in deep learning, has achieved exc...
We adopt Markov Decision Processes (MDP) to model sequential decision problems, which have the chara...
In inverse reinforcement learning an observer infers the reward distribution available for actions i...
In observational learning (OL), organisms learn from observing the behavior of others. There are at ...
Theory of mind (ToM) is the psychological construct by which we model another’s internal mental stat...
This paper proposes model-free imitation learning named Entropy-Regularized Imitation Learning (ERIL...
While there is no doubt that social signals affect human reinforcement learning, there is still no c...
One of the primary mechanisms thought to underlie action selection in the brain is Reinforcement Lea...
The ability to integrate past and current feedback associated with di↵erent environmental stimuli is...
Attention and learning are cognitive control processes that are closely related. This thesis investi...
Theories of reward learning in neuroscience have focused on two families of algorithms thought to ca...
SummaryWhen an organism receives a reward, it is crucial to know which of many candidate actions cau...
A major challenge faced by machine learning community is the decision making problems under uncertai...
The aim of Inverse Reinforcement Learning (IRL) is to infer a reward function R from a policy pi. To...
Reinforcement learning (RL) provides a framework involving two diverse approaches to reward-based de...
Artificial intelligence, particularly through recent advancements in deep learning, has achieved exc...
We adopt Markov Decision Processes (MDP) to model sequential decision problems, which have the chara...
In inverse reinforcement learning an observer infers the reward distribution available for actions i...
In observational learning (OL), organisms learn from observing the behavior of others. There are at ...
Theory of mind (ToM) is the psychological construct by which we model another’s internal mental stat...
This paper proposes model-free imitation learning named Entropy-Regularized Imitation Learning (ERIL...
While there is no doubt that social signals affect human reinforcement learning, there is still no c...
One of the primary mechanisms thought to underlie action selection in the brain is Reinforcement Lea...
The ability to integrate past and current feedback associated with di↵erent environmental stimuli is...
Attention and learning are cognitive control processes that are closely related. This thesis investi...
Theories of reward learning in neuroscience have focused on two families of algorithms thought to ca...
SummaryWhen an organism receives a reward, it is crucial to know which of many candidate actions cau...
A major challenge faced by machine learning community is the decision making problems under uncertai...
The aim of Inverse Reinforcement Learning (IRL) is to infer a reward function R from a policy pi. To...
Reinforcement learning (RL) provides a framework involving two diverse approaches to reward-based de...
Artificial intelligence, particularly through recent advancements in deep learning, has achieved exc...
We adopt Markov Decision Processes (MDP) to model sequential decision problems, which have the chara...