This thesis is concerned with the problem of how to make decisions in an uncertain world. We use a model of uncertainty based on Markov decision problems, and develop a number of algorithms for decision-making both for the planning problem, in which the model is known in advance, and for the reinforcement learning problem in which the decision-making agent does not know the model and must learn to make good decisions by trial and error. The basis for much of this work is the use of structural representations of problems. If a problem is represented in a structured way we can compute or learn plans that take advantage of this structure for computational gains. This is because the structure allows us to perform abstraction. Rather th...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
This paper introduces a framework for Planning while Learning where an agent is given a goal to achi...
Decision-theoretic planning techniques are increasingly being used to obtain (optimal) plans for dom...
The problem of making decisions is ubiquitous in life. This problem becomes even more complex when t...
Reinforcement Learning has emerged as a useful framework for learning to perform a task optimally fr...
Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a k...
We investigate the use Markov Decision Processes a.s a means of representing worlds in which action...
Decision theory addresses the task of choosing an action; it provides robust decision-making criteri...
Replicating the human ability to solve complex planning problems based on minimal prior knowledge ha...
This thesis is about chance and choice, or decisions under uncertainty. The desire for creating an ...
An autonomous agent should make good decisions quickly. These two considerations --- effectiveness a...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
Deep, model based reinforcement learning has shown state of the art, human-exceeding performance in ...
Reinforcement learning systems are often concerned with balancing exploration of untested actions ag...
AbstractUncertainty, inherent in most real-world domains, can cause failure of apparently sound clas...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
This paper introduces a framework for Planning while Learning where an agent is given a goal to achi...
Decision-theoretic planning techniques are increasingly being used to obtain (optimal) plans for dom...
The problem of making decisions is ubiquitous in life. This problem becomes even more complex when t...
Reinforcement Learning has emerged as a useful framework for learning to perform a task optimally fr...
Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a k...
We investigate the use Markov Decision Processes a.s a means of representing worlds in which action...
Decision theory addresses the task of choosing an action; it provides robust decision-making criteri...
Replicating the human ability to solve complex planning problems based on minimal prior knowledge ha...
This thesis is about chance and choice, or decisions under uncertainty. The desire for creating an ...
An autonomous agent should make good decisions quickly. These two considerations --- effectiveness a...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
Deep, model based reinforcement learning has shown state of the art, human-exceeding performance in ...
Reinforcement learning systems are often concerned with balancing exploration of untested actions ag...
AbstractUncertainty, inherent in most real-world domains, can cause failure of apparently sound clas...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
This paper introduces a framework for Planning while Learning where an agent is given a goal to achi...
Decision-theoretic planning techniques are increasingly being used to obtain (optimal) plans for dom...