(A) Example of undersampling depth. Serial position curve is constructed based on occurrence of error trials with particular cue arrangements. Task variations with large cue set size (100 vs 5 cues; 20x difference) require more sampling. Increased test sampling depth (5000 → 30000 → 100000 test samples) of same network reveals recency-like behavior when none was apparent at lower test sampling depth (5000 test samples). (B) Recency-like behavior could also be masked by sampling frequency for networks/tasks that rapidly progressed from random to target-selective strategy. Increased sampling frequency (training points ∈{0, 11, 20, 40, 50} → {0, 5, 11, 15, 20, 30, 40, 50} → {0, 5, 7, 9, 11, 13, 15, 20, 30, 40, 50}x103) reveals recency-like beh...
Lake and Baroni (2018) introduced the SCAN dataset probing the ability of seq2seq models to captur...
When a large feedforward neural network is trained on a small training set, it typically performs po...
Residuals of serial position curve linear fits (linear fit residual) as described in Wittig et al. 2...
(A) Serial position curve exhibits specific characteristics depending on strategy. Left panels shows...
(A) Activity states of an RNN trained on 50k episodes after first three cues. Coloring by first cue ...
Slope and residual plots reveal clear instances where RNNs progress from random (low performance, sm...
(A) An example of five cues encoded using a non-similarity based method, each equal magnitude and un...
(A) PCA of network activity after presentation of all cues in a trial for trials answered correctly,...
(A) Slope vs performance plot for RNNs trained with a reward scheme that explicitly rewards recency ...
(A) Serial position curve error profiles reveal similar uneven and directional trends as with the or...
Performance over eight variations of three working memory tasks from Wittig et al. 2016. Across all ...
(A) Match errors plot similar to Fig 3B: A in sequences shown at the top represents the cue of inter...
<p>Figure shows a plot of the number of networks correctly identified (as of total) with decreasing...
(A) PCA of network activity after presentation of all cues in a trial for trials answered correctly,...
Network activity trajectories on test trials for three levels of training (trained on 0k, 20k, 50k t...
Lake and Baroni (2018) introduced the SCAN dataset probing the ability of seq2seq models to captur...
When a large feedforward neural network is trained on a small training set, it typically performs po...
Residuals of serial position curve linear fits (linear fit residual) as described in Wittig et al. 2...
(A) Serial position curve exhibits specific characteristics depending on strategy. Left panels shows...
(A) Activity states of an RNN trained on 50k episodes after first three cues. Coloring by first cue ...
Slope and residual plots reveal clear instances where RNNs progress from random (low performance, sm...
(A) An example of five cues encoded using a non-similarity based method, each equal magnitude and un...
(A) PCA of network activity after presentation of all cues in a trial for trials answered correctly,...
(A) Slope vs performance plot for RNNs trained with a reward scheme that explicitly rewards recency ...
(A) Serial position curve error profiles reveal similar uneven and directional trends as with the or...
Performance over eight variations of three working memory tasks from Wittig et al. 2016. Across all ...
(A) Match errors plot similar to Fig 3B: A in sequences shown at the top represents the cue of inter...
<p>Figure shows a plot of the number of networks correctly identified (as of total) with decreasing...
(A) PCA of network activity after presentation of all cues in a trial for trials answered correctly,...
Network activity trajectories on test trials for three levels of training (trained on 0k, 20k, 50k t...
Lake and Baroni (2018) introduced the SCAN dataset probing the ability of seq2seq models to captur...
When a large feedforward neural network is trained on a small training set, it typically performs po...
Residuals of serial position curve linear fits (linear fit residual) as described in Wittig et al. 2...