We consider the multi-armed bandit problem with penalties for switching that include setup delays and costs, extending the former results of the author for the special case with no switching delays. A priority index for projects with setup delays that characterizes, in part, optimal policies was introduced by Asawa and Teneketzis in 1996, yet without giving a means of computing it. We present a fast two-stage index computing method, which computes the continuation index (which applies when the project has been set up) in a first stage and certain extra quantities with cubic (arithmetic-operation) complexity in the number of project states and then computes the switching index (which applies when the project is not set up), in a second stage...
We study the asymptotic optimal control of multi-class restless bandits. A restless bandit is a cont...
Abstract—The multi-armed bandit problem and one of its most interesting extensions, the restless ban...
We study the asymptotic optimal control of multi-class restless bandits. A restless bandit is a cont...
We consider the multi-armed bandit problem with penalties for switching that include setup delays an...
This paper addresses the multi-armed bandit problem with switching penalties including both costs an...
This paper addresses the multi-armed bandit problem with switching penalties including both costs an...
This paper addresses the multi-armed bandit problem with switching costs. Asawa and Teneketzis (1996...
This article belongs to the Special Issue Applied ProbabilityThe Whittle index for restless bandits ...
The Whittle index [P. Whittle (1988). Restless bandits: Activity allocation in a changing world. J. ...
This article considers an important class of discrete time restless bandits, given by the discounted...
In 1988 Whittle introduced an important but intractable class of restless bandit problems which gene...
We investigate the general multi-armed bandit problem with multiple servers. We determine a conditio...
International audienceIn this paper we study a Multi-Armed Restless Bandit Problem (MARBP) subject t...
We investigate the optimal allocation of effort to a collection of n projects. The projects are &apo...
We generalise classical multiarmed bandits to allow for the distribution of a (fixed amount of a) di...
We study the asymptotic optimal control of multi-class restless bandits. A restless bandit is a cont...
Abstract—The multi-armed bandit problem and one of its most interesting extensions, the restless ban...
We study the asymptotic optimal control of multi-class restless bandits. A restless bandit is a cont...
We consider the multi-armed bandit problem with penalties for switching that include setup delays an...
This paper addresses the multi-armed bandit problem with switching penalties including both costs an...
This paper addresses the multi-armed bandit problem with switching penalties including both costs an...
This paper addresses the multi-armed bandit problem with switching costs. Asawa and Teneketzis (1996...
This article belongs to the Special Issue Applied ProbabilityThe Whittle index for restless bandits ...
The Whittle index [P. Whittle (1988). Restless bandits: Activity allocation in a changing world. J. ...
This article considers an important class of discrete time restless bandits, given by the discounted...
In 1988 Whittle introduced an important but intractable class of restless bandit problems which gene...
We investigate the general multi-armed bandit problem with multiple servers. We determine a conditio...
International audienceIn this paper we study a Multi-Armed Restless Bandit Problem (MARBP) subject t...
We investigate the optimal allocation of effort to a collection of n projects. The projects are &apo...
We generalise classical multiarmed bandits to allow for the distribution of a (fixed amount of a) di...
We study the asymptotic optimal control of multi-class restless bandits. A restless bandit is a cont...
Abstract—The multi-armed bandit problem and one of its most interesting extensions, the restless ban...
We study the asymptotic optimal control of multi-class restless bandits. A restless bandit is a cont...