In this paper, we investigate the power of online learning in stochastic network optimization with unknown system statistics a priori. We are interested in understanding how information and learning can be efficiently incorporated into system control techniques, and what are the fundamental benefits of doing so. We propose two Online Learning-Aided Control techniques, OLAC and OLAC2, that explicitly utilize the past system information in current system control via a learning procedure called dual learning. We prove strong performance guarantees of the proposed algorithms: OLAC and OLAC2 achieve the near-optimal [O(), O([log(1/)]2)] utility-delay tradeoff and OLAC2 possesses an O(−2/3) con-vergence time. OLAC and OLAC2 are probably the first...
We consider an online learning scenario in which the learner can make predictions on the basis of a ...
Optimal control theory and machine learning techniques are combined to formulate and solve in closed...
We develop an online gradient algorithm for optimizing the performance of product-form networks thro...
This paper considers online stochastic scheduling problems where time constraints severely limit th...
2012-11-26The formulations and theories of multi-armed bandit (MAB) problems provide fundamental too...
The principal characteristic of stochastic adaptive optimization problems is the uncertainty in the ...
In this paper, we combine optimal control theory and machine learning techniques to propose and solv...
This paper aims to produce an effective online scheduling technique, where a base station (BS) sched...
This paper proposes a novel algorithm for solving discrete online learning prob-lems under stochasti...
Abstract—Combinatorial network optimization algorithms that compute optimal structures taking into a...
We consider the fundamental problem of prediction with expert advice where the experts are "optimiza...
Recently a line of work has shown the applicability of tools from online optimization for control, l...
Most work on sequential learning assumes a fixed set of actions that are available all the time. How...
Optimal control theory and machine learning techniques are combined to propose and solve in closed f...
We consider online learning when the time hori-zon is unknown. We apply a minimax analysis, beginnin...
We consider an online learning scenario in which the learner can make predictions on the basis of a ...
Optimal control theory and machine learning techniques are combined to formulate and solve in closed...
We develop an online gradient algorithm for optimizing the performance of product-form networks thro...
This paper considers online stochastic scheduling problems where time constraints severely limit th...
2012-11-26The formulations and theories of multi-armed bandit (MAB) problems provide fundamental too...
The principal characteristic of stochastic adaptive optimization problems is the uncertainty in the ...
In this paper, we combine optimal control theory and machine learning techniques to propose and solv...
This paper aims to produce an effective online scheduling technique, where a base station (BS) sched...
This paper proposes a novel algorithm for solving discrete online learning prob-lems under stochasti...
Abstract—Combinatorial network optimization algorithms that compute optimal structures taking into a...
We consider the fundamental problem of prediction with expert advice where the experts are "optimiza...
Recently a line of work has shown the applicability of tools from online optimization for control, l...
Most work on sequential learning assumes a fixed set of actions that are available all the time. How...
Optimal control theory and machine learning techniques are combined to propose and solve in closed f...
We consider online learning when the time hori-zon is unknown. We apply a minimax analysis, beginnin...
We consider an online learning scenario in which the learner can make predictions on the basis of a ...
Optimal control theory and machine learning techniques are combined to formulate and solve in closed...
We develop an online gradient algorithm for optimizing the performance of product-form networks thro...