This thesis studies three problems in sequential decision making across two different frameworks. The first framework we consider is online learning: for each round of a $T$ round repeated game, the learner makes a prediction, the adversary observes this prediction and reveals the true outcome, and the learner suffers some loss based on the accuracy of the prediction. The learner's aim is to minimize the regret, which is defined to be the difference between the learner's cumulative loss and the cumulative loss of the best prediction strategy in some class. We study the minimax strategy, which guarantees the lowest regret against all possible adversary strategies. In general, computing the minimax strategy is exponential in $T$; w...
We consider a class of sequential decision making problems in the presence of uncertainty, which bel...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
Reinforcement learning (RL) has gained an increasing interest in recent years, being expected to del...
We investigate the problem of cumulative regret minimization for individual sequence prediction with...
We investigate the problem of cumulative regret minimization for individual sequence prediction with...
We study the regret of optimal strategies for online convex optimization games. Using von Neumann's ...
We study the regret of optimal strategies for online convex optimization games. Using von Neumann's ...
We consider an agent interacting with an environment in a single stream of actions, observations, an...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
We consider the problem of minimizing the long term average expected regret of an agent in an online...
We address the online linear optimization problem when the actions of the forecaster are represented...
We study the problem of online learning with a notion of regret defined with respect to a set of str...
We consider a class of sequential decision making problems in the presence of uncertainty, which bel...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
Reinforcement learning (RL) has gained an increasing interest in recent years, being expected to del...
We investigate the problem of cumulative regret minimization for individual sequence prediction with...
We investigate the problem of cumulative regret minimization for individual sequence prediction with...
We study the regret of optimal strategies for online convex optimization games. Using von Neumann's ...
We study the regret of optimal strategies for online convex optimization games. Using von Neumann's ...
We consider an agent interacting with an environment in a single stream of actions, observations, an...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
We consider the problem of minimizing the long term average expected regret of an agent in an online...
We address the online linear optimization problem when the actions of the forecaster are represented...
We study the problem of online learning with a notion of regret defined with respect to a set of str...
We consider a class of sequential decision making problems in the presence of uncertainty, which bel...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...