The study of online convex optimization in the bandit setting was initiated by Klein-berg (2004) and Flaxman et al. (2005). Such a setting models a decision maker that has to make decisions in the face of adversari-ally chosen convex loss functions. Moreover, the only information the decision maker re-ceives are the losses. The identities of the loss functions themselves are not revealed. In this setting, we reduce the gap between the best known lower and upper bounds for the class of smooth convex functions, i.e. convex functions with a Lipschitz continuous gradi-ent. Building upon existing work on self-concordant regularizers and one-point gradi-ent estimation, we give the first algorithm whose expected regret is O(T 2/3), ignoring consta...
The framework of online learning with memory naturally captures learning problems with temporal effe...
We study the rates of growth of the regret in online convex optimization. First, we show that a simp...
In an online convex optimization problem a decision-maker makes a sequence of decisions, i.e., choos...
Bandit convex optimization is a special case of online convex optimization with partial information....
Consider the online convex optimization problem, in which a player has to choose ac-tions iterativel...
We consider online convex optimizations in the bandit setting. The decision maker does not know the ...
Stochastic and adversarial data are two widely studied settings in online learning. But many optimiz...
We aim to design universal algorithms for online convex optimization, which can handle multiple comm...
We aim to design universal algorithms for online convex optimization, which can handle multiple comm...
Online convex optimization (OCO) is a powerful algorithmic framework that has extensive applications...
Some of the most compelling applications of online convex optimization, includ-ing online prediction...
Some of the most compelling applications of online convex optimization, includ-ing online prediction...
Some of the most compelling applications of online convex optimization, includ-ing online prediction...
We address online linear optimization problems when the possible actions of the decision maker are r...
We demonstrate a modification of the algorithm of Dani et al for the online linear optimization prob...
The framework of online learning with memory naturally captures learning problems with temporal effe...
We study the rates of growth of the regret in online convex optimization. First, we show that a simp...
In an online convex optimization problem a decision-maker makes a sequence of decisions, i.e., choos...
Bandit convex optimization is a special case of online convex optimization with partial information....
Consider the online convex optimization problem, in which a player has to choose ac-tions iterativel...
We consider online convex optimizations in the bandit setting. The decision maker does not know the ...
Stochastic and adversarial data are two widely studied settings in online learning. But many optimiz...
We aim to design universal algorithms for online convex optimization, which can handle multiple comm...
We aim to design universal algorithms for online convex optimization, which can handle multiple comm...
Online convex optimization (OCO) is a powerful algorithmic framework that has extensive applications...
Some of the most compelling applications of online convex optimization, includ-ing online prediction...
Some of the most compelling applications of online convex optimization, includ-ing online prediction...
Some of the most compelling applications of online convex optimization, includ-ing online prediction...
We address online linear optimization problems when the possible actions of the decision maker are r...
We demonstrate a modification of the algorithm of Dani et al for the online linear optimization prob...
The framework of online learning with memory naturally captures learning problems with temporal effe...
We study the rates of growth of the regret in online convex optimization. First, we show that a simp...
In an online convex optimization problem a decision-maker makes a sequence of decisions, i.e., choos...