Restless multi-armed bandits are often used to model budget-constrained resource allocation tasks where receipt of the resource is associated with an increased probability of a favorable state transition. Prior work assumes that individual arms only benefit if they receive the resource directly. However, many allocation tasks occur within communities and can be characterized by positive externalities that allow arms to derive partial benefit when their neighbor(s) receive the resource. We thus introduce networked restless bandits, a novel multi-armed bandit setting in which arms are both restless and embedded within a directed graph. We then present Greta, a graph-aware, Whittle index-based heuristic algorithm that can be used to efficientl...
We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics in which a...
The multi-armed bandit(MAB) problem is a simple yet powerful framework that has been extensively stu...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...
Restless multi-armed bandits are often used to model budget-constrained resource allocation tasks wh...
We study a finite-horizon restless multi-armed bandit problem with multiple actions, dubbed as R(MA)...
Restless multi-armed bandits (RMABs) are an important model to optimize allocation of limited resour...
International audienceWe develop a unifying framework to obtain efficient index policies for restles...
The problem of rested and restless multi-armed bandits with constrained availability (RMAB-CA) of ar...
We consider the multi-armed restless bandit problem (RMABP) with an infinite horizon average cost ob...
Abstract—The multi-armed bandit problem and one of its most interesting extensions, the restless ban...
155 pagesWe consider multi-action restless bandits with multiple resource constraints, also referred...
International audienceThis paper presents a new reinforcement learning (RL) algorithm called Bellman...
The class of restless bandits as proposed by Whittle (1988) have long been known to be intractable. ...
We study a resource allocation problem with varying requests and with resources of limited capacity ...
Abstract—We consider two variants of the standard multi-armed bandit problem, namely, the multi-arme...
We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics in which a...
The multi-armed bandit(MAB) problem is a simple yet powerful framework that has been extensively stu...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...
Restless multi-armed bandits are often used to model budget-constrained resource allocation tasks wh...
We study a finite-horizon restless multi-armed bandit problem with multiple actions, dubbed as R(MA)...
Restless multi-armed bandits (RMABs) are an important model to optimize allocation of limited resour...
International audienceWe develop a unifying framework to obtain efficient index policies for restles...
The problem of rested and restless multi-armed bandits with constrained availability (RMAB-CA) of ar...
We consider the multi-armed restless bandit problem (RMABP) with an infinite horizon average cost ob...
Abstract—The multi-armed bandit problem and one of its most interesting extensions, the restless ban...
155 pagesWe consider multi-action restless bandits with multiple resource constraints, also referred...
International audienceThis paper presents a new reinforcement learning (RL) algorithm called Bellman...
The class of restless bandits as proposed by Whittle (1988) have long been known to be intractable. ...
We study a resource allocation problem with varying requests and with resources of limited capacity ...
Abstract—We consider two variants of the standard multi-armed bandit problem, namely, the multi-arme...
We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics in which a...
The multi-armed bandit(MAB) problem is a simple yet powerful framework that has been extensively stu...
Multi-armed bandit problems formalize the exploration-exploitation trade-offs arising in several ind...