Abstract. We attend to the classic setting where an observer needs to inform a tracker about an arbitrary time varying function f: N0 → Z. This is an optimization problem, where both wrong values at the tracker and sending updates entail a certain cost. We consider an online variant of this problem, i.e., at time t, the observer only knows f(t′) for all t ′ ≤ t. In this paper, we generalize existing cost models (with an emphasis on concave and convex penalties) and present two online algorithms. Our analysis shows that these algorithms perform well in a large class of models, and are even optimal in some settings.
We address online linear optimization problems when the possible actions of the decision maker are r...
In this paper we consider online learning in fi-nite Markov decision processes (MDPs) with changing ...
This work addresses decentralized online optimization in nonstationary environments. A network of ag...
We propose and study a new class of online problems, which we call online tracking. Suppose an obser...
We propose and study a new class of online problems, which we call online tracking. Suppose an obser...
In online tracking, an observer S receives a sequence of values, one per time instance, from a data ...
In this paper we propose a model-based approach to the design of online optimization algorithms, wit...
In the online linear optimization problem, a learner must choose, in each round, a decision from a s...
In the online linear optimization problem, a learner must choose, in each round, a decision from a s...
We consider online convex optimizations in the bandit setting. The decision maker does not know the ...
Bandit convex optimization is a special case of online convex optimization with partial information....
The framework of online learning with memory naturally captures learning problems with temporal effe...
We study how to adapt to smoothly-varying (‘easy’) environments in well-known online learning proble...
We study the rates of growth of the regret in online convex optimization. First, we show that a simp...
Tracking time-varying sparse signals is a recent problem with widespread applications. Techniques de...
We address online linear optimization problems when the possible actions of the decision maker are r...
In this paper we consider online learning in fi-nite Markov decision processes (MDPs) with changing ...
This work addresses decentralized online optimization in nonstationary environments. A network of ag...
We propose and study a new class of online problems, which we call online tracking. Suppose an obser...
We propose and study a new class of online problems, which we call online tracking. Suppose an obser...
In online tracking, an observer S receives a sequence of values, one per time instance, from a data ...
In this paper we propose a model-based approach to the design of online optimization algorithms, wit...
In the online linear optimization problem, a learner must choose, in each round, a decision from a s...
In the online linear optimization problem, a learner must choose, in each round, a decision from a s...
We consider online convex optimizations in the bandit setting. The decision maker does not know the ...
Bandit convex optimization is a special case of online convex optimization with partial information....
The framework of online learning with memory naturally captures learning problems with temporal effe...
We study how to adapt to smoothly-varying (‘easy’) environments in well-known online learning proble...
We study the rates of growth of the regret in online convex optimization. First, we show that a simp...
Tracking time-varying sparse signals is a recent problem with widespread applications. Techniques de...
We address online linear optimization problems when the possible actions of the decision maker are r...
In this paper we consider online learning in fi-nite Markov decision processes (MDPs) with changing ...
This work addresses decentralized online optimization in nonstationary environments. A network of ag...