184 pagesUtilizing structure in mathematical modeling is instrumental for better model de- sign, creation, and solution. In this dissertation, we explore smoothness-based structure for problems, specifically involving decisions in uncertain environments, in different stages of the modeling process. What we offer in each case is a new framework to approach the question, as well as improvements in either algorithmic effectiveness or theoretical guarantees. First, we consider repeated decision problems under stochastic environments as modeled by Markov Decision Processes (MDPs), where one aims to minimize in- finite horizon cumulative discounted value. The standard Dynamic Programming (DP) approaches suffer from the curse-of-dimensionality, wh...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
We consider the problem of minimizing the long term average expected regret of an agent in an online...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
The problem of making optimal decisions in uncertain conditions is central to Artificial Intelligenc...
<p>This dissertation describes sequential decision making problems in non-stationary environments. O...
This chapter presents an overview of simulation-based techniques useful for solving Markov decision ...
To quickly achieve good performance, reinforcement-learning algorithms for acting in large continuou...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
The problem of making decisions is ubiquitous in life. This problem becomes even more complex when t...
The impetus for exploration in reinforcement learning (RL) is decreasing uncertainty about the envir...
This paper presents a novel algorithm for learning in a class of stochastic Markov decision process...
We consider a learning problem where the decision maker interacts with a standard Markov decision pr...
Markov decision processes (MDP) is a standard modeling tool for sequential decision making in a dyna...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
We consider the problem of minimizing the long term average expected regret of an agent in an online...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
The problem of making optimal decisions in uncertain conditions is central to Artificial Intelligenc...
<p>This dissertation describes sequential decision making problems in non-stationary environments. O...
This chapter presents an overview of simulation-based techniques useful for solving Markov decision ...
To quickly achieve good performance, reinforcement-learning algorithms for acting in large continuou...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
The problem of making decisions is ubiquitous in life. This problem becomes even more complex when t...
The impetus for exploration in reinforcement learning (RL) is decreasing uncertainty about the envir...
This paper presents a novel algorithm for learning in a class of stochastic Markov decision process...
We consider a learning problem where the decision maker interacts with a standard Markov decision pr...
Markov decision processes (MDP) is a standard modeling tool for sequential decision making in a dyna...
This dissertation considers a particular aspect of sequential decision making under uncertainty in w...
We consider the problem of minimizing the long term average expected regret of an agent in an online...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...