Bayesian Reinforcement Learning has generated substantial interest recently, as it provides an elegant solution to the exploration-exploitation trade-off in reinforce-ment learning. However most investigations of Bayesian reinforcement learning to date focus on the standard Markov Decision Processes (MDPs). Our goal is to extend these ideas to the more general Partially Observable MDP (POMDP) framework, where the state is a hidden variable. To address this problem, we in-troduce a new mathematical model, the Bayes-Adaptive POMDP. This new model allows us to (1) improve knowledge of the POMDP domain through interaction with the environment, and (2) plan optimal sequences of actions which can trade-off between improving the model, identifying...
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning contr...
We consider an autonomous agent operating in a stochastic, partially-observable, multiagent environm...
Bayesian methods for reinforcement learning (BRL) allow model uncertainty to be considered explicitl...
In recent work, Bayesian methods for exploration in Markov decision processes (MDPs) and for solving...
AbstractActing in domains where an agent must plan several steps ahead to achieve a goal can be a ch...
Model-based Bayesian Reinforcement Learning (BRL) provides a principled solution to dealing with the...
Acting in domains where an agent must plan several steps ahead to achieve a goal can be a challengin...
The Partially Observable Markov Decision Process (POMDP) framework has proven useful in planning dom...
In this paper, we describe how techniques from reinforcement learning might be used to approach the ...
Partially observable Markov decision processes (POMDPs) are interesting because they provide a gener...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
Abstract. We consider the active learning problem of inferring the transition model of a Markov Deci...
People are efficient when they make decisions under uncertainty, even when their decisions have long...
Much of reinforcement learning theory is built on top of oracles that are computationally hard to im...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning contr...
We consider an autonomous agent operating in a stochastic, partially-observable, multiagent environm...
Bayesian methods for reinforcement learning (BRL) allow model uncertainty to be considered explicitl...
In recent work, Bayesian methods for exploration in Markov decision processes (MDPs) and for solving...
AbstractActing in domains where an agent must plan several steps ahead to achieve a goal can be a ch...
Model-based Bayesian Reinforcement Learning (BRL) provides a principled solution to dealing with the...
Acting in domains where an agent must plan several steps ahead to achieve a goal can be a challengin...
The Partially Observable Markov Decision Process (POMDP) framework has proven useful in planning dom...
In this paper, we describe how techniques from reinforcement learning might be used to approach the ...
Partially observable Markov decision processes (POMDPs) are interesting because they provide a gener...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
Abstract. We consider the active learning problem of inferring the transition model of a Markov Deci...
People are efficient when they make decisions under uncertainty, even when their decisions have long...
Much of reinforcement learning theory is built on top of oracles that are computationally hard to im...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning contr...
We consider an autonomous agent operating in a stochastic, partially-observable, multiagent environm...
Bayesian methods for reinforcement learning (BRL) allow model uncertainty to be considered explicitl...