This paper addresses the multi-armed bandit problem with switching costs. Asawa and Teneketzis (1996) introduced an index that partly characterizes optimal policies, attaching to each bandit state a "continuation index" (its Gittins index) and a "switching index". They proposed to jointly compute both as the Gittins index of a bandit having 2n states — when the original bandit has n states — which results in an eight-fold increase in O(n^3) arithmetic operations relative to those to compute the continuation index alone. This paper presents a more efficient, decoupled computation method, which in a first stage computes the continuation index and then, in a second stage, computes the switching index an order of magnitude faster in at most n^...
We investigate the general multi-armed bandit problem with multiple servers. We determine a conditio...
Includes bibliographical references (p. 5).Supported by the ARO. DAAL03-92-G-0115 Supported by Sieme...
Whittle index is a generalization of Gittins index that provides very efficient allocation rules for...
This paper addresses the multi-armed bandit problem with switching costs. Asawa and Teneketzis (1996...
This paper addresses the multi-armed bandit problem with switching penalties including both costs an...
This paper addresses the multi-armed bandit problem with switching penalties including both costs an...
We consider the multi-armed bandit problem with penalties for switching that include setup delays an...
The Theorem of Gittins and Jones (1974) is, perhaps, the single most powerful result in the literat...
This note shows that the optimal choice of k simultaneous experiments in a stationary multi-armed ba...
The Whittle index [P. Whittle (1988). Restless bandits: Activity allocation in a changing world. J. ...
Abstract. A variant of the multi-armed bandit problem was recently introduced by Dimitriu, Tetali an...
In 1988 Whittle introduced an important but intractable class of restless bandit problems which gene...
We generalise classical multiarmed bandits to allow for the distribution of a (fixed amount of a) di...
This paper presents a new fast-pivoting algorithm that computes the n Gittins index values of an n-s...
Whittle index is a generalization of Gittins index that provides very efficient allocation rules for...
We investigate the general multi-armed bandit problem with multiple servers. We determine a conditio...
Includes bibliographical references (p. 5).Supported by the ARO. DAAL03-92-G-0115 Supported by Sieme...
Whittle index is a generalization of Gittins index that provides very efficient allocation rules for...
This paper addresses the multi-armed bandit problem with switching costs. Asawa and Teneketzis (1996...
This paper addresses the multi-armed bandit problem with switching penalties including both costs an...
This paper addresses the multi-armed bandit problem with switching penalties including both costs an...
We consider the multi-armed bandit problem with penalties for switching that include setup delays an...
The Theorem of Gittins and Jones (1974) is, perhaps, the single most powerful result in the literat...
This note shows that the optimal choice of k simultaneous experiments in a stationary multi-armed ba...
The Whittle index [P. Whittle (1988). Restless bandits: Activity allocation in a changing world. J. ...
Abstract. A variant of the multi-armed bandit problem was recently introduced by Dimitriu, Tetali an...
In 1988 Whittle introduced an important but intractable class of restless bandit problems which gene...
We generalise classical multiarmed bandits to allow for the distribution of a (fixed amount of a) di...
This paper presents a new fast-pivoting algorithm that computes the n Gittins index values of an n-s...
Whittle index is a generalization of Gittins index that provides very efficient allocation rules for...
We investigate the general multi-armed bandit problem with multiple servers. We determine a conditio...
Includes bibliographical references (p. 5).Supported by the ARO. DAAL03-92-G-0115 Supported by Sieme...
Whittle index is a generalization of Gittins index that provides very efficient allocation rules for...