Restless multi-armed bandits (RMABs) are an important model to optimize allocation of limited resources in sequential decision-making settings. Typical RMABs assume the budget --- the number of arms pulled --- to be fixed for each step in the planning horizon. However, for realistic real-world planning, resources are not necessarily limited at each planning step; we may be able to distribute surplus resources in one round to an earlier or later round. In real-world planning settings, this flexibility in budget is often constrained to within a subset of consecutive planning steps, e.g., weekly planning of a monthly budget. In this paper we define a general class of RMABs with flexible budget, which we term F-RMABs, and provide an algorithm t...
Abstract—We consider a task assignment problem for a fleet of UAVs in a surveillance/search mission....
We propose a mathematical programming approach for the classical PSPACE - hard problem of n restless...
We consider the multi-armed restless bandit problem (RMABP) with an infinite horizon average cost ob...
Abstract—The multi-armed bandit problem and one of its most interesting extensions, the restless ban...
We introduce the budget-limited multi-armed bandit (MAB), which captures situations where a learners...
We introduce the budget–limited multi–armed bandit (MAB), which captures situations whe...
We study a resource allocation problem with varying requests and with resources of limited capacity ...
In budget-limited multi-armed bandit (MAB) problems, the learners actions are costly and constrained...
Restless multi-armed bandits are often used to model budget-constrained resource allocation tasks wh...
155 pagesWe consider multi-action restless bandits with multiple resource constraints, also referred...
We introduce robustness in \textit{restless multi-armed bandits} (RMABs), a popular model for constr...
Abstract—We consider two variants of the standard multi-armed bandit problem, namely, the multi-arme...
We study the multi-armed bandit problems with budget constraint and variable costs (MAB-BV). In this...
We study the infinitely many-armed bandit problem with budget constraints, where the number of arms ...
This paper develops a framework based on convex optimization and economic ideas to formulate and sol...
Abstract—We consider a task assignment problem for a fleet of UAVs in a surveillance/search mission....
We propose a mathematical programming approach for the classical PSPACE - hard problem of n restless...
We consider the multi-armed restless bandit problem (RMABP) with an infinite horizon average cost ob...
Abstract—The multi-armed bandit problem and one of its most interesting extensions, the restless ban...
We introduce the budget-limited multi-armed bandit (MAB), which captures situations where a learners...
We introduce the budget–limited multi–armed bandit (MAB), which captures situations whe...
We study a resource allocation problem with varying requests and with resources of limited capacity ...
In budget-limited multi-armed bandit (MAB) problems, the learners actions are costly and constrained...
Restless multi-armed bandits are often used to model budget-constrained resource allocation tasks wh...
155 pagesWe consider multi-action restless bandits with multiple resource constraints, also referred...
We introduce robustness in \textit{restless multi-armed bandits} (RMABs), a popular model for constr...
Abstract—We consider two variants of the standard multi-armed bandit problem, namely, the multi-arme...
We study the multi-armed bandit problems with budget constraint and variable costs (MAB-BV). In this...
We study the infinitely many-armed bandit problem with budget constraints, where the number of arms ...
This paper develops a framework based on convex optimization and economic ideas to formulate and sol...
Abstract—We consider a task assignment problem for a fleet of UAVs in a surveillance/search mission....
We propose a mathematical programming approach for the classical PSPACE - hard problem of n restless...
We consider the multi-armed restless bandit problem (RMABP) with an infinite horizon average cost ob...