This thesis deals with online optimization of discrete performance measures in Markovian models with incomplete information. We consider a setting where a physical realization of the model is sequentially obtained over a number of periods. The information gathered to date is used in order to efficiently run the model in future days. The information is incomplete in two ways: (i) model parameters are initially unknown (the demand rates in our case), but can be estimated from the physical realizations; and (ii), the demands are censored when the system is in some boundary states. The method of Sample Average Approximation is used to solve the optimization problem. More precisely, in each period, sample paths are generated from the distributio...
Online learning is discussed from the viewpoint of Bayesian statistical inference. By replacing the ...
In this paper, we investigate the power of online learning in stochastic network optimization with u...
2012-11-26The formulations and theories of multi-armed bandit (MAB) problems provide fundamental too...
AbstractIn modern computer systems, the intermittent behaviour of infrequent, additional loads affec...
We propose and analyze two different Bayesian online algorithms for learning in discrete Hidden Mark...
© 2015 The Authors. Published by Elsevier B.V.In modern computer systems, the intermittent behaviour...
International audienceWe consider online learning in finite stochastic Markovian environments where ...
In this paper we consider online learning in fi-nite Markov decision processes (MDPs) with changing ...
This paper considers online stochastic scheduling problems where time constraints severely limit th...
<p>This dissertation describes sequential decision making problems in non-stationary environments. O...
We present and analyze three different online algorithms for learning in discrete Hidden Markov Mode...
This thesis considers the analysis and design of algorithms for the management and control of uncert...
We consider an online learning scenario in which the learner can make predictions on the basis of a ...
In this paper, we study bottleneck identification in networks via extracting minimax paths. Many rea...
Abstract—A fundamental theoretical problem in opportunistic spectrum access is the following: a sing...
Online learning is discussed from the viewpoint of Bayesian statistical inference. By replacing the ...
In this paper, we investigate the power of online learning in stochastic network optimization with u...
2012-11-26The formulations and theories of multi-armed bandit (MAB) problems provide fundamental too...
AbstractIn modern computer systems, the intermittent behaviour of infrequent, additional loads affec...
We propose and analyze two different Bayesian online algorithms for learning in discrete Hidden Mark...
© 2015 The Authors. Published by Elsevier B.V.In modern computer systems, the intermittent behaviour...
International audienceWe consider online learning in finite stochastic Markovian environments where ...
In this paper we consider online learning in fi-nite Markov decision processes (MDPs) with changing ...
This paper considers online stochastic scheduling problems where time constraints severely limit th...
<p>This dissertation describes sequential decision making problems in non-stationary environments. O...
We present and analyze three different online algorithms for learning in discrete Hidden Markov Mode...
This thesis considers the analysis and design of algorithms for the management and control of uncert...
We consider an online learning scenario in which the learner can make predictions on the basis of a ...
In this paper, we study bottleneck identification in networks via extracting minimax paths. Many rea...
Abstract—A fundamental theoretical problem in opportunistic spectrum access is the following: a sing...
Online learning is discussed from the viewpoint of Bayesian statistical inference. By replacing the ...
In this paper, we investigate the power of online learning in stochastic network optimization with u...
2012-11-26The formulations and theories of multi-armed bandit (MAB) problems provide fundamental too...