Thesis: Ph. D., Massachusetts Institute of Technology, Sloan School of Management, Operations Research Center, 2017.This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.Cataloged from student-submitted PDF version of thesis.Includes bibliographical references (pages 159-166).This thesis is concerned with the design and analysis of new algorithms for sequential optimization problems with limited feedback on the outcomes of alternatives when the environment is not perfectly known in advance and may react to past decisions. Depending on the setting, we take either a worst-case approach, which protects against a fully adversarial environment, or a hinds...
In this thesis, we study the role of adaptivity in decision-making problems under uncertainty. The f...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
In the online linear optimization problem, a learner must choose, in each round, a decision from a s...
We address online linear optimization problems when the possible actions of the decision maker are r...
We consider online convex optimizations in the bandit setting. The decision maker does not know the ...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
We demonstrate a modification of the algorithm of Dani et al for the online linear optimization prob...
International audienceWe study a class of online convex optimization problems with long-term budget ...
We study the regret of optimal strategies for online convex optimization games. Using von Neumann's ...
This paper addresses the problem of minimizing a convex, Lipschitz function f over a convex, compact...
Multi-Armed Bandits (MAB) constitute the most fundamental model for sequential decision making probl...
The minmax regret problem for combinatorial optimization under uncertainty can be viewed as a zero-s...
We address the online linear optimization problem when the actions of the forecaster are represented...
In the classical stochastic k-armed bandit problem, in each of a sequence of rounds, a decision make...
23 pagesInternational audienceWe address the online linear optimization problem when the actions of ...
In this thesis, we study the role of adaptivity in decision-making problems under uncertainty. The f...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
In the online linear optimization problem, a learner must choose, in each round, a decision from a s...
We address online linear optimization problems when the possible actions of the decision maker are r...
We consider online convex optimizations in the bandit setting. The decision maker does not know the ...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
We demonstrate a modification of the algorithm of Dani et al for the online linear optimization prob...
International audienceWe study a class of online convex optimization problems with long-term budget ...
We study the regret of optimal strategies for online convex optimization games. Using von Neumann's ...
This paper addresses the problem of minimizing a convex, Lipschitz function f over a convex, compact...
Multi-Armed Bandits (MAB) constitute the most fundamental model for sequential decision making probl...
The minmax regret problem for combinatorial optimization under uncertainty can be viewed as a zero-s...
We address the online linear optimization problem when the actions of the forecaster are represented...
In the classical stochastic k-armed bandit problem, in each of a sequence of rounds, a decision make...
23 pagesInternational audienceWe address the online linear optimization problem when the actions of ...
In this thesis, we study the role of adaptivity in decision-making problems under uncertainty. The f...
First, we study online learning with an extended notion of regret, which is defined with respect to ...
In the online linear optimization problem, a learner must choose, in each round, a decision from a s...