Model-based Reinforcement Learning (MBRL) holds promise for data-efficiency by planning with model-generated experience in addition to learning with experience from the environment. However, in complex or changing environments, models in MBRL will inevitably be imperfect, and their detrimental effects on learning can be difficult to mitigate. In this work, we question whether the objective of these models should be the accurate simulation of environment dynamics at all. We focus our investigations on Dyna-style planning in a prediction setting. First, we highlight and support three motivating points: a perfectly accurate model of environment dynamics is not practically achievable, is not necessary, and is not always the most useful anyways....
Planning and reinforcement learning are two key approaches to sequential decision making. Multi-step...
The practical application of learning agents requires sample efficient and interpretable algorithms....
Model-based reinforcement learning has attracted wide attention due to its superior sample efficienc...
Model-based reinforcement learning (MBRL) is believed to have much higher sample efficiency compared...
Model-based reinforcement learning (MBRL) has often been touted for its potential to improve on the ...
Model-Based Reinforcement Learning (MBRL) algorithms solve sequential decision-making problems, usua...
When an agent cannot represent a perfectly accurate model of its environment's dynamics, model-based...
In meta-reinforcement learning, an agent is trained in multiple different environments and attempts ...
Humans can develop their internal model of the external world and use it for decision making. Reinfo...
This thesis studies the problem of learning a model in Model-Based Reinforcement Learning (MBRL). We...
<div><p>Many accounts of decision making and reinforcement learning posit the existence of two disti...
Formal exploration approaches in model-based reinforcement learning estimate the accuracy of the cur...
Model-based reinforcement learning (RL) methods attempt to learn a dynamics model to simulate the re...
Model-based reinforcement learning, in which a model of the environment's dynamics is learned a...
Formal exploration approaches in model-based reinforcement learning estimate the accuracy of the cur...
Planning and reinforcement learning are two key approaches to sequential decision making. Multi-step...
The practical application of learning agents requires sample efficient and interpretable algorithms....
Model-based reinforcement learning has attracted wide attention due to its superior sample efficienc...
Model-based reinforcement learning (MBRL) is believed to have much higher sample efficiency compared...
Model-based reinforcement learning (MBRL) has often been touted for its potential to improve on the ...
Model-Based Reinforcement Learning (MBRL) algorithms solve sequential decision-making problems, usua...
When an agent cannot represent a perfectly accurate model of its environment's dynamics, model-based...
In meta-reinforcement learning, an agent is trained in multiple different environments and attempts ...
Humans can develop their internal model of the external world and use it for decision making. Reinfo...
This thesis studies the problem of learning a model in Model-Based Reinforcement Learning (MBRL). We...
<div><p>Many accounts of decision making and reinforcement learning posit the existence of two disti...
Formal exploration approaches in model-based reinforcement learning estimate the accuracy of the cur...
Model-based reinforcement learning (RL) methods attempt to learn a dynamics model to simulate the re...
Model-based reinforcement learning, in which a model of the environment's dynamics is learned a...
Formal exploration approaches in model-based reinforcement learning estimate the accuracy of the cur...
Planning and reinforcement learning are two key approaches to sequential decision making. Multi-step...
The practical application of learning agents requires sample efficient and interpretable algorithms....
Model-based reinforcement learning has attracted wide attention due to its superior sample efficienc...