peer reviewedReinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms come short of achieving this goal because the amount of exploration required is often too costly and/or too time consuming for online learning. As a result, RL is mostly used for offline learning in simulated environments. We propose a new algorithm, called BEETLE, for effective online learning that is computationally efficient while minimizing the amount of exploration. We take a Bayesian model-based approach, framing RL as a partially observable Markov decision process. Our two main contributions are the analytical derivation that the optimal value fun...
It is often assumed that agents in multiagent systems with state uncertainty have full knowledge of ...
This paper proposes an online tree-based I3ayesian approach for reinforcement learning. For inferenc...
AbstractActing in domains where an agent must plan several steps ahead to achieve a goal can be a ch...
Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an on...
Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an ...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
In this paper, we study the problem of efficient online reinforcement learning in the infinite horiz...
Artificial Intelligence (AI) has been an active field of research for over a century now. The resear...
Bayesian methods for reinforcement learning (RL) allow model uncertainty to be considered explicitly...
Reinforcement Learning has emerged as a useful framework for learning to perform a task optimally fr...
Reinforcement learning (RL) has gained an increasing interest in recent years, being expected to del...
This thesis explores Bayesian and variational inference in the context of solving the reinforcement ...
We address the problem of Bayesian reinforcement learning using efficient model-based online plannin...
For autonomous robots, we propose an approximate model-based Bayesian reinforcement learning (MB-BRL...
Thesis (Ph.D.)--University of Washington, 2020Informed and robust decision making in the face of unc...
It is often assumed that agents in multiagent systems with state uncertainty have full knowledge of ...
This paper proposes an online tree-based I3ayesian approach for reinforcement learning. For inferenc...
AbstractActing in domains where an agent must plan several steps ahead to achieve a goal can be a ch...
Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an on...
Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an ...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
In this paper, we study the problem of efficient online reinforcement learning in the infinite horiz...
Artificial Intelligence (AI) has been an active field of research for over a century now. The resear...
Bayesian methods for reinforcement learning (RL) allow model uncertainty to be considered explicitly...
Reinforcement Learning has emerged as a useful framework for learning to perform a task optimally fr...
Reinforcement learning (RL) has gained an increasing interest in recent years, being expected to del...
This thesis explores Bayesian and variational inference in the context of solving the reinforcement ...
We address the problem of Bayesian reinforcement learning using efficient model-based online plannin...
For autonomous robots, we propose an approximate model-based Bayesian reinforcement learning (MB-BRL...
Thesis (Ph.D.)--University of Washington, 2020Informed and robust decision making in the face of unc...
It is often assumed that agents in multiagent systems with state uncertainty have full knowledge of ...
This paper proposes an online tree-based I3ayesian approach for reinforcement learning. For inferenc...
AbstractActing in domains where an agent must plan several steps ahead to achieve a goal can be a ch...