This dissertation considers a particular aspect of sequential decision making under uncertainty in which, at each stage, a decision-making agent operating in an uncertain world takes an action that elicits a reinforcement signal and causes the state of the world to change. Optimal learning is a pattern of behavior that yields the highest expected total reward over the entire duration of an agent\u27s interaction with its uncertain world. The problem of determining an optimal learning strategy is a sort of meta-problem, with optimality defined with respect to a distribution of environments that the agent is likely to encounter. Given this prior uncertainty over possible environments, the optimal-learning agent must collect and use informatio...
Research on the implications of learning-by-doing has typically been restricted to specifications of...
In this work, we address risk-averse Bayes-adaptive reinforcement learning. We pose the problem of o...
The exploration/exploitation dilemma is a fundamental but often computationally intractable problem ...
In this work we consider probabilistic approaches to sequential decision making. The ultimate goal i...
We consider the problem of "optimal learning" for Markov decision processes with uncertain...
Markov Decision Processes are a mathematical framework widely used for stochastic optimization and c...
We consider the dilemma of taking sequential action within a nebulous and costly stochastic system. ...
Sequential decision making is a fundamental task faced by any intelligent agent in an extended inter...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
How humans achieve long-term goals in an uncertain environment, via repeated trials and noisy observ...
How humans achieve long-term goals in an uncertain environment, via repeated trials and noisy observ...
This chapter discusses decision making under uncertainty. More specifically, it offers an overview o...
We study the framework of optimal decision making under uncertainty where the agents do not know the...
This chapter presents an overview of simulation-based techniques useful for solving Markov decision ...
This thesis addresses the dilemma between exploration and exploitation as it is faced by reinforceme...
Research on the implications of learning-by-doing has typically been restricted to specifications of...
In this work, we address risk-averse Bayes-adaptive reinforcement learning. We pose the problem of o...
The exploration/exploitation dilemma is a fundamental but often computationally intractable problem ...
In this work we consider probabilistic approaches to sequential decision making. The ultimate goal i...
We consider the problem of "optimal learning" for Markov decision processes with uncertain...
Markov Decision Processes are a mathematical framework widely used for stochastic optimization and c...
We consider the dilemma of taking sequential action within a nebulous and costly stochastic system. ...
Sequential decision making is a fundamental task faced by any intelligent agent in an extended inter...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
How humans achieve long-term goals in an uncertain environment, via repeated trials and noisy observ...
How humans achieve long-term goals in an uncertain environment, via repeated trials and noisy observ...
This chapter discusses decision making under uncertainty. More specifically, it offers an overview o...
We study the framework of optimal decision making under uncertainty where the agents do not know the...
This chapter presents an overview of simulation-based techniques useful for solving Markov decision ...
This thesis addresses the dilemma between exploration and exploitation as it is faced by reinforceme...
Research on the implications of learning-by-doing has typically been restricted to specifications of...
In this work, we address risk-averse Bayes-adaptive reinforcement learning. We pose the problem of o...
The exploration/exploitation dilemma is a fundamental but often computationally intractable problem ...