Meta-parameters in reinforcement learning should be tuned to the environmental dynamics and the animal performance. Here, we propose a biologically plausible meta-reinforcement learning algorithm for tuning these meta-parameters in a dynamic, adaptive manner. We tested our algorithm in both a simulation of a Markov decision task and in a non-linear control task. Our results show that the algorithm robustly finds appropriate meta-parameter values, and controls the meta-parameter time course, in both static and dynamic environments. We suggest that the phasic and tonic components of dopamine neuron firing can encode the signal required for meta-learning of reinforcement learning
Animals excel at adapting their intentions, attention, and actions to the environment, making them r...
Modern deep reinforcement learning (RL) algorithms, despite being at the forefront of artificial int...
In reinforcement learning, the duality between exploitation and exploration has long been an import...
Meta-reinforcement learning (meta-RL) algorithms enable agents to adapt quickly to tasks from few sa...
Conflictual cues and unexpected changes in human real-case scenarios may be detrimental to the execu...
Supplementary files for article Context meta-reinforcement learning via neuromodulation Meta-reinfor...
Deep Reinforcement Learning has demonstrated the potential of neural networks tuned with gradient de...
Meta-learning is a branch of machine learning which trains neural network models to synthesize a wid...
Meta-reinforcement learning has the potential to enable artificial agents to master new skills with ...
© 2019, Springer Nature Switzerland AG. In the last few years, we have witnessed a resurgence of int...
Meta-learning algorithms leverage regularities that are present on a set of tasks to speed up and im...
In meta-reinforcement learning, an agent is trained in multiple different environments and attempts ...
Many real-world domains are subject to a structured non-stationarity which affects the agent's goals...
The State of the Art of the young domain of Meta-Learning [3] is held by the connectionist approach....
Animals excel at adapting their intentions, attention, and actions to the environment, making them r...
Animals excel at adapting their intentions, attention, and actions to the environment, making them r...
Modern deep reinforcement learning (RL) algorithms, despite being at the forefront of artificial int...
In reinforcement learning, the duality between exploitation and exploration has long been an import...
Meta-reinforcement learning (meta-RL) algorithms enable agents to adapt quickly to tasks from few sa...
Conflictual cues and unexpected changes in human real-case scenarios may be detrimental to the execu...
Supplementary files for article Context meta-reinforcement learning via neuromodulation Meta-reinfor...
Deep Reinforcement Learning has demonstrated the potential of neural networks tuned with gradient de...
Meta-learning is a branch of machine learning which trains neural network models to synthesize a wid...
Meta-reinforcement learning has the potential to enable artificial agents to master new skills with ...
© 2019, Springer Nature Switzerland AG. In the last few years, we have witnessed a resurgence of int...
Meta-learning algorithms leverage regularities that are present on a set of tasks to speed up and im...
In meta-reinforcement learning, an agent is trained in multiple different environments and attempts ...
Many real-world domains are subject to a structured non-stationarity which affects the agent's goals...
The State of the Art of the young domain of Meta-Learning [3] is held by the connectionist approach....
Animals excel at adapting their intentions, attention, and actions to the environment, making them r...
Animals excel at adapting their intentions, attention, and actions to the environment, making them r...
Modern deep reinforcement learning (RL) algorithms, despite being at the forefront of artificial int...
In reinforcement learning, the duality between exploitation and exploration has long been an import...