Performance of model-free algorithms on a 7-item pairwise serial learning task, presenting all 21 pairs during training and then reversing the order of the stimuli during testing

Visualization of the contents of memory for the three algorithms under simulated conditions.

Greg Jensen (440175)
Fabian Muñoz (2317330)
Yelda Alkan (340378)
Vincent P. Ferrera (347678)
Herbert S. Terrace (800539)

September 2015

Three phases were included for each algorithm: 200 trials of adjacent pairs only, followed by 200...

Estimated response accuracy on the first transfer trial for each of the 21 possible pairs.

Greg Jensen (440175)
Fabian Muñoz (2317330)
Yelda Alkan (340378)
Vincent P. Ferrera (347678)
Herbert S. Terrace (800539)

September 2015

Estimates compare performance by subjects (blue lines) to those generated by simulations using ea...

The Sample Complexity of Teaching by Reinforcement on Q-Learning

Zhang, Xuezhou
Bharti, Shubham
Ma, Yuzhe
Singla, Adish
Zhu, Xiaojin

May 2021

We study the sample complexity of teaching, termed as ``teaching dimension" (TDim) in the literature...

Performance of model-free algorithms on a 7-item TI transfer task (adjacent pair training, all pair testing) with an intervening period during which only the pair FG was presented

Greg Jensen (712648)
Vincent P Ferrera (5126399)
Herbert S Terrace (2814577)

April 2019

(A) Mean performance of Q-learning on non-terminal pairs throughout training and testing, sorted by ...

Performance of model-free algorithms on a 7-item TI transfer task (adjacent pair training, all pair testing) in which the amount of reward delivered corresponded to the rank of the correct response (i.e. 1 unit for A, 2 units for B, and so forth)

Greg Jensen (712648)
Vincent P Ferrera (5126399)
Herbert S Terrace (2814577)

April 2019

(A) Mean performance of Q-learning on non-terminal pairs throughout training and testing, sorted by ...

Performance of model-free algorithms on a 7-item TI transfer task (adjacent pair training, all pair testing)

Greg Jensen (712648)
Vincent P Ferrera (5126399)
Herbert S Terrace (2814577)

April 2019

(A) Mean performance of Q-learning on non-terminal pairs throughout training and testing, sorted by ...

Performance of model-free algorithms on a 10-item list-linking procedure

Greg Jensen (712648)
Vincent P Ferrera (5126399)
Herbert S Terrace (2814577)

April 2019

During training, two 5-item lists were trained (adjacent pairs only). In the Linking condition, acto...

Performance of model-based algorithms on a 7-item TI transfer task (adjacent pair training, all pair testing)

Greg Jensen (712648)
Vincent P Ferrera (5126399)
Herbert S Terrace (2814577)

April 2019

(A) Mean performance of RL-Elo on non-terminal pairs throughout training and testing, sorted by symb...

Performance of model-based algorithms on a 10-item list-linking procedure

Greg Jensen (712648)
Vincent P Ferrera (5126399)
Herbert S Terrace (2814577)

April 2019

During training, two 5-item lists were trained (adjacent pairs only). In the Linking condition, acto...

An Empirical Investigation of Transfer Effects for Reinforcement Learning

Jung-Sing Jwo
Ching-Sheng Lin
Cheng-Hsiung Lee
Ya-Ching Lo

January 2020

Previous studies have shown that training a reinforcement model for the sorting problem takes very l...

INVESTIGATING THE BEHAVIOUR OF Q(lambda)

J. Wyatt
G. Hayes
J. Hallam

January 1996

this paper we examine the behaviour of one such model-free algorithm, Q() [2]. This algorithm shows ...

Animal and algorithm performance in TI paradigm

Greg Jensen (712648)
Fabián Muñoz (334960)
Yelda Alkan (340378)
Vincent Ferrera (520480)
Herbert Terrace (440178)

April 2015

Performance on non-terminal stimulus pairs (i.e. those excluding stimuli A and G</em...

Simulated performance under TI adjacent-pair training for betasort, betaQ, and Q/softmax

Greg Jensen (712648)
Fabián Muñoz (334960)
Yelda Alkan (340378)
Vincent Ferrera (520480)
Herbert Terrace (440178)

April 2015

Simulated response accuracy for all stimulus pairs of a seven-item list using betasort (red), bet...

The QV Family Compared to Other Reinforcement Learning Algorithms

Wiering, Marco A.
van Hasselt, Hado

January 2009

This paper describes several new online model-free reinforcement learning (RL) algorithms. We design...

Result of the model comparison procedure.

Holger Mohr (5815976)
Katharina Zwosta (5815973)
Dimitrije Markovic (3340317)
Sebastian Bitzer (527870)
Uta Wolfensteller (272162)
Hannes Ruge (5815964)

December 2018

The trial-and-error learning task was performed by N = 85 subjects. For each subject, it was tested ...

Visualization of the contents of memory for the three algorithms under simulated conditions.

Greg Jensen (440175)
Fabian Muñoz (2317330)
Yelda Alkan (340378)
Vincent P. Ferrera (347678)
Herbert S. Terrace (800539)

September 2015

Three phases were included for each algorithm: 200 trials of adjacent pairs only, followed by 200...

Estimated response accuracy on the first transfer trial for each of the 21 possible pairs.

Greg Jensen (440175)
Fabian Muñoz (2317330)
Yelda Alkan (340378)
Vincent P. Ferrera (347678)
Herbert S. Terrace (800539)

September 2015

Estimates compare performance by subjects (blue lines) to those generated by simulations using ea...

The Sample Complexity of Teaching by Reinforcement on Q-Learning

Zhang, Xuezhou
Bharti, Shubham
Ma, Yuzhe
Singla, Adish
Zhu, Xiaojin

May 2021

We study the sample complexity of teaching, termed as ``teaching dimension" (TDim) in the literature...

Performance of model-free algorithms on a 7-item TI transfer task (adjacent pair training, all pair testing) with an intervening period during which only the pair FG was presented

Greg Jensen (712648)
Vincent P Ferrera (5126399)
Herbert S Terrace (2814577)

April 2019

(A) Mean performance of Q-learning on non-terminal pairs throughout training and testing, sorted by ...

Performance of model-free algorithms on a 7-item TI transfer task (adjacent pair training, all pair testing) in which the amount of reward delivered corresponded to the rank of the correct response (i.e. 1 unit for A, 2 units for B, and so forth)

Greg Jensen (712648)
Vincent P Ferrera (5126399)
Herbert S Terrace (2814577)

April 2019

(A) Mean performance of Q-learning on non-terminal pairs throughout training and testing, sorted by ...

Performance of model-free algorithms on a 7-item TI transfer task (adjacent pair training, all pair testing)

Greg Jensen (712648)
Vincent P Ferrera (5126399)
Herbert S Terrace (2814577)

April 2019

(A) Mean performance of Q-learning on non-terminal pairs throughout training and testing, sorted by ...

Performance of model-free algorithms on a 10-item list-linking procedure

Greg Jensen (712648)
Vincent P Ferrera (5126399)
Herbert S Terrace (2814577)

April 2019

During training, two 5-item lists were trained (adjacent pairs only). In the Linking condition, acto...

Performance of model-based algorithms on a 7-item TI transfer task (adjacent pair training, all pair testing)

Greg Jensen (712648)
Vincent P Ferrera (5126399)
Herbert S Terrace (2814577)

April 2019

(A) Mean performance of RL-Elo on non-terminal pairs throughout training and testing, sorted by symb...

Performance of model-based algorithms on a 10-item list-linking procedure

Greg Jensen (712648)
Vincent P Ferrera (5126399)
Herbert S Terrace (2814577)

April 2019

During training, two 5-item lists were trained (adjacent pairs only). In the Linking condition, acto...

An Empirical Investigation of Transfer Effects for Reinforcement Learning

Jung-Sing Jwo
Ching-Sheng Lin
Cheng-Hsiung Lee
Ya-Ching Lo

January 2020

Previous studies have shown that training a reinforcement model for the sorting problem takes very l...

INVESTIGATING THE BEHAVIOUR OF Q(lambda)

J. Wyatt
G. Hayes
J. Hallam

January 1996

this paper we examine the behaviour of one such model-free algorithm, Q() [2]. This algorithm shows ...

Animal and algorithm performance in TI paradigm

Greg Jensen (712648)
Fabián Muñoz (334960)
Yelda Alkan (340378)
Vincent Ferrera (520480)
Herbert Terrace (440178)

April 2015

Performance on non-terminal stimulus pairs (i.e. those excluding stimuli A and G</em...

Simulated performance under TI adjacent-pair training for betasort, betaQ, and Q/softmax

Greg Jensen (712648)
Fabián Muñoz (334960)
Yelda Alkan (340378)
Vincent Ferrera (520480)
Herbert Terrace (440178)

April 2015

Simulated response accuracy for all stimulus pairs of a seven-item list using betasort (red), bet...

The QV Family Compared to Other Reinforcement Learning Algorithms

Wiering, Marco A.
van Hasselt, Hado

January 2009

This paper describes several new online model-free reinforcement learning (RL) algorithms. We design...

Result of the model comparison procedure.

Holger Mohr (5815976)
Katharina Zwosta (5815973)
Dimitrije Markovic (3340317)
Sebastian Bitzer (527870)
Uta Wolfensteller (272162)
Hannes Ruge (5815964)

December 2018

The trial-and-error learning task was performed by N = 85 subjects. For each subject, it was tested ...

Visualization of the contents of memory for the three algorithms under simulated conditions.

Greg Jensen (440175)
Fabian Muñoz (2317330)
Yelda Alkan (340378)
Vincent P. Ferrera (347678)
Herbert S. Terrace (800539)

September 2015

Three phases were included for each algorithm: 200 trials of adjacent pairs only, followed by 200...

Estimated response accuracy on the first transfer trial for each of the 21 possible pairs.

Greg Jensen (440175)
Fabian Muñoz (2317330)
Yelda Alkan (340378)
Vincent P. Ferrera (347678)
Herbert S. Terrace (800539)

September 2015

Estimates compare performance by subjects (blue lines) to those generated by simulations using ea...

The Sample Complexity of Teaching by Reinforcement on Q-Learning

Zhang, Xuezhou
Bharti, Shubham
Ma, Yuzhe
Singla, Adish
Zhu, Xiaojin

May 2021

We study the sample complexity of teaching, termed as ``teaching dimension" (TDim) in the literature...

Performance of model-free algorithms on a 7-item pairwise serial learning task, presenting all 21 pairs during training and then reversing the order of the stimuli during testing

Abstract

Extracted data

Performance of model-free algorithms on a 7-item pairwise serial learning task, presenting all 21 pairs during training and then reversing the order of the stimuli during testing

Abstract

Extracted data

Related items

Related items