<div><p>Transitive inference (the ability to infer that <i>B</i> > <i>D</i> given that <i>B</i> > <i>C</i> and <i>C</i> > <i>D</i>) is a widespread characteristic of serial learning, observed in dozens of species. Despite these robust behavioral effects, reinforcement learning models reliant on reward prediction error or associative strength routinely fail to perform these inferences. We propose an algorithm called <i>betasort</i>, inspired by cognitive processes, which performs transitive inference at low computational cost. This is accomplished by (1) representing stimulus positions along a unit span using beta distributions, (2) treating positive and negative feedback asymmetrically, and (3) updating the position of every stimulus during...
The exploration/exploitation tradeoff – pursuing a known reward vs. sampling from lesser known optio...
Theories of reward learning in neuroscience have focused on two families of algorithms thought to ca...
Primate errors in transitive ‘inference’: A two-tier learning model Received: date / Revised: date A...
Transitive inference (the ability to infer that “B> D ” given that “B> C ” and “C> D”) is a...
Transitive inference (the ability to infer that B> D given that B> C and C> D) is a wide-sp...
Olga F. Lazareva (Mentor)The ability of reinforcement-based models to predict transitive inference b...
<p>A schematic specification of the betasort algorithm over the course of one trial. Rectangles refe...
In order to survive and reproduce, individual animals need to navigate through a multidimensional ut...
Research on discrimination-based transitive inference (TI) has demonstrated a widespread capacity fo...
In the basic verbal task from Piaget, when a relation of the form if A > B and B > C is given, a log...
<p>(Top) Each session used a novel seven-item list, like the one depicted here. However, subjects we...
Transitive inference (TI) has been studied in humans and several animals such as rats, pigeons and f...
The goal of temporal difference (TD) reinforcement learning is to maximize outcomes and improve futu...
Humans and animals are capable of evaluating actions by considering their long-run future rewards th...
Humans and animals are capable of evaluating actions by considering their long-run future rewards th...
The exploration/exploitation tradeoff – pursuing a known reward vs. sampling from lesser known optio...
Theories of reward learning in neuroscience have focused on two families of algorithms thought to ca...
Primate errors in transitive ‘inference’: A two-tier learning model Received: date / Revised: date A...
Transitive inference (the ability to infer that “B> D ” given that “B> C ” and “C> D”) is a...
Transitive inference (the ability to infer that B> D given that B> C and C> D) is a wide-sp...
Olga F. Lazareva (Mentor)The ability of reinforcement-based models to predict transitive inference b...
<p>A schematic specification of the betasort algorithm over the course of one trial. Rectangles refe...
In order to survive and reproduce, individual animals need to navigate through a multidimensional ut...
Research on discrimination-based transitive inference (TI) has demonstrated a widespread capacity fo...
In the basic verbal task from Piaget, when a relation of the form if A > B and B > C is given, a log...
<p>(Top) Each session used a novel seven-item list, like the one depicted here. However, subjects we...
Transitive inference (TI) has been studied in humans and several animals such as rats, pigeons and f...
The goal of temporal difference (TD) reinforcement learning is to maximize outcomes and improve futu...
Humans and animals are capable of evaluating actions by considering their long-run future rewards th...
Humans and animals are capable of evaluating actions by considering their long-run future rewards th...
The exploration/exploitation tradeoff – pursuing a known reward vs. sampling from lesser known optio...
Theories of reward learning in neuroscience have focused on two families of algorithms thought to ca...
Primate errors in transitive ‘inference’: A two-tier learning model Received: date / Revised: date A...