Understanding how individuals learn in an unknown environment is an important problem in economics. We model and examine experimentally behavior in a very simple multi-armed bandit framework in which participants do not know the inter-temporal payoff structure. We propose a baseline reinforcement learning model that allows for pattern-recognition and change in the strategy space. We also analyse three augmented versions that accommodate observational learning from the actions and/or payoffs of another player. The models successfully reproduce the distributional properties of observed discovery times and total payoffs. Our study further shows that when one of the pair discovers the hidden pattern, observing another’s actions and/or payoffs i...
The authors examine learning in all experiments they could locate involving one hundred periods or m...
In traditional Reinforcement Learning (RL) [4], a single agent learns to act in an environment by op...
In this research we study the statistical mechanics of cooperation through a simple case of aspirati...
International audienceUnderstanding how individuals learn in an unknown environment is an important ...
International audienceUnderstanding how individuals learn in an unknown environment is an important ...
The literature on learning in unknown environments emphasises reinforcing on actions which produce p...
Imitation learning has been widely used to speed up learning in novice agents, by allowing them to l...
How do humans search for rewards? This question is commonly studied using multi-armed bandit tasks, ...
How do people learn? We assess, in a model-free manner, subjectsʼ belief dynamics in a two-armed ban...
An n- armed bandit task was used to investigate the trade-off between exploratory (...
International audienceReinforcement learning (RL) is a paradigm for learning sequential decision mak...
Our joint paper, with Romuald Elie and Carl Remlinger entitled Reinforcement Learning in Economics a...
In this paper, we study the learning behavior possibly emerging in six series of prediction market e...
We investigate learning in a setting where each period a population has to choose between two action...
How people achieve long-term goals in an imperfectly known environment, via repeated tries and noisy...
The authors examine learning in all experiments they could locate involving one hundred periods or m...
In traditional Reinforcement Learning (RL) [4], a single agent learns to act in an environment by op...
In this research we study the statistical mechanics of cooperation through a simple case of aspirati...
International audienceUnderstanding how individuals learn in an unknown environment is an important ...
International audienceUnderstanding how individuals learn in an unknown environment is an important ...
The literature on learning in unknown environments emphasises reinforcing on actions which produce p...
Imitation learning has been widely used to speed up learning in novice agents, by allowing them to l...
How do humans search for rewards? This question is commonly studied using multi-armed bandit tasks, ...
How do people learn? We assess, in a model-free manner, subjectsʼ belief dynamics in a two-armed ban...
An n- armed bandit task was used to investigate the trade-off between exploratory (...
International audienceReinforcement learning (RL) is a paradigm for learning sequential decision mak...
Our joint paper, with Romuald Elie and Carl Remlinger entitled Reinforcement Learning in Economics a...
In this paper, we study the learning behavior possibly emerging in six series of prediction market e...
We investigate learning in a setting where each period a population has to choose between two action...
How people achieve long-term goals in an imperfectly known environment, via repeated tries and noisy...
The authors examine learning in all experiments they could locate involving one hundred periods or m...
In traditional Reinforcement Learning (RL) [4], a single agent learns to act in an environment by op...
In this research we study the statistical mechanics of cooperation through a simple case of aspirati...