Budget-Constrained Multi-Armed Bandits With Multiple Plays

Zhou, Datong
Tomlin, Claire

Publication date

April 2018

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Abstract

We study the multi-armed bandit problem with multiple plays and a budget constraint for both the stochastic and the adversarial setting. At each round, exactly K out of N possible arms have to be played (with 1 ≤ K <= N). In addition to observing the individual rewards for each arm played, the player also learns a vector of costs which has to be covered with an a-priori defined budget B. The game ends when the sum of current costs associated with the played arms exceeds the remaining budget. Firstly, we analyze this setting for the stochastic case, for which we assume each arm to have an underlying cost and reward distribution with support [cmin, 1] and [0, 1], respectively. We derive an Upper Confidence Bound (UCB) algorithm which achie...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Budget-Constrained Multi-Armed Bandits With Multiple Plays

Abstract

Extracted data

Budget-Constrained Multi-Armed Bandits With Multiple Plays

Abstract

Extracted data

Related items

Related items