Equipping artificial agents with useful exploration mechanisms remains a challenge to this day. Humans, on the other hand, seem to manage the trade-off between exploration and exploitation effortlessly. In the present article, we put forward the hypothesis that they accomplish this by making optimal use of limited computational resources. We study this hypothesis by meta-learning reinforcement learning algorithms that sacrifice performance for a shorter description length. The emerging class of models captures human exploration behavior better than previously considered approaches, such as Boltzmann exploration, upper confidence bound algorithms, and Thompson sampling. We additionally demonstrate that changes in description length produce t...
International audienceTaking inspiration from neural principles of decision-makingis of particular i...
tatsujit[at]mail.dendai.ac.jp In an uncertain environment, decision-making meets two opposing demand...
International audienceTaking inspiration from neural principles of decision-makingis of particular i...
Equipping artificial agents with useful exploration mechanisms remains a challenge to this day. Huma...
Equipping artificial agents with useful exploration mechanisms remains a challenge to this day. Huma...
Balancing exploration and exploitation is one of the central problems in reinforcement learning. We ...
International audienceFast adaptation to changes in the environment requires both natural and artifi...
International audienceFast adaptation to changes in the environment requires both natural and artifi...
Many types of intelligent behavior can be framed as a search problem, where an individual must explo...
Many types of intelligent behavior can be framed as a search problem, where an individual must explo...
In reinforcement learning, the duality between exploitation and exploration has long been an import...
learning from past experiences helps orient the exploration of unknown environments. yet how we lear...
Despite their apparent importance for the acquisition of full-fledged human intelligence, mechanisms...
Biological brains are inherently limited in their capacity to process and store information, but are...
Biological brains are inherently limited in their capacity to process and store information, but are...
International audienceTaking inspiration from neural principles of decision-makingis of particular i...
tatsujit[at]mail.dendai.ac.jp In an uncertain environment, decision-making meets two opposing demand...
International audienceTaking inspiration from neural principles of decision-makingis of particular i...
Equipping artificial agents with useful exploration mechanisms remains a challenge to this day. Huma...
Equipping artificial agents with useful exploration mechanisms remains a challenge to this day. Huma...
Balancing exploration and exploitation is one of the central problems in reinforcement learning. We ...
International audienceFast adaptation to changes in the environment requires both natural and artifi...
International audienceFast adaptation to changes in the environment requires both natural and artifi...
Many types of intelligent behavior can be framed as a search problem, where an individual must explo...
Many types of intelligent behavior can be framed as a search problem, where an individual must explo...
In reinforcement learning, the duality between exploitation and exploration has long been an import...
learning from past experiences helps orient the exploration of unknown environments. yet how we lear...
Despite their apparent importance for the acquisition of full-fledged human intelligence, mechanisms...
Biological brains are inherently limited in their capacity to process and store information, but are...
Biological brains are inherently limited in their capacity to process and store information, but are...
International audienceTaking inspiration from neural principles of decision-makingis of particular i...
tatsujit[at]mail.dendai.ac.jp In an uncertain environment, decision-making meets two opposing demand...
International audienceTaking inspiration from neural principles of decision-makingis of particular i...