[[abstract]]A bandit problem with infinitely many Bernoulli arms is considered. The parameters of Bernoulli arms are independent and identically distributed random variables from a common distribution with beta(a, b). We investigate the k-failure strategy which is a modification of Robbins's stay-with-a-winner/switch-on-a-loser strategy and three other strategies proposed recently by Berry et al. (1997, Ann. Statist., 25, 2103–2116). We show that the k-failure strategy performs poorly when b is greater than 1, and the best strategy among the k-failure strategies is the 1-failure strategy when b is less than or equal to 1. Utilizing the formulas derived by Berry et al. (1997), we obtain the asymptotic expected failure rates of these three st...
We consider a bandit problem which involves sequential sampling from two populations (arms). Each ar...
We consider a stochastic bandit problem with a possibly infinite number of arms. We write p∗ for the...
In this thesis, we study strategies for sequential resource allocation, under the so-called stochast...
[[abstract]]A bandit problem with infinitely many Bernoulli arms is considered. The parameters of Be...
[[abstract]]A bandit problem consisting of a sequence of n choices (n→∞) from a number of infinitely...
We consider a bandit problem consisting of a sequence of n choices from an infinite number of Bernou...
In this paper we investigate the multi-armed bandit problem, where each arm generates an infinite se...
In this paper we investigate the multi-armed bandit problem, where each arm generates an infinite se...
We consider an infinite-armed bandit problem with Bernoulli rewards. The mean rewards are independen...
AbstractOne of a number of Bernoulli processes is selected at each of a number of stages. A success ...
An empirical comparative study is made of a sample of action selection policies on a test suite of t...
This document presents in a unified way different results about the optimal solution of several mult...
In this paper, we study an independent Bernoulli two-armed bandit with unknown parameters ρ and λ, w...
We consider a multiarmed bandit problem where the expected reward of each arm is a linear function o...
Cette thèse s'inscrit dans les domaines de l'apprentissage statistique et de la statistique séquenti...
We consider a bandit problem which involves sequential sampling from two populations (arms). Each ar...
We consider a stochastic bandit problem with a possibly infinite number of arms. We write p∗ for the...
In this thesis, we study strategies for sequential resource allocation, under the so-called stochast...
[[abstract]]A bandit problem with infinitely many Bernoulli arms is considered. The parameters of Be...
[[abstract]]A bandit problem consisting of a sequence of n choices (n→∞) from a number of infinitely...
We consider a bandit problem consisting of a sequence of n choices from an infinite number of Bernou...
In this paper we investigate the multi-armed bandit problem, where each arm generates an infinite se...
In this paper we investigate the multi-armed bandit problem, where each arm generates an infinite se...
We consider an infinite-armed bandit problem with Bernoulli rewards. The mean rewards are independen...
AbstractOne of a number of Bernoulli processes is selected at each of a number of stages. A success ...
An empirical comparative study is made of a sample of action selection policies on a test suite of t...
This document presents in a unified way different results about the optimal solution of several mult...
In this paper, we study an independent Bernoulli two-armed bandit with unknown parameters ρ and λ, w...
We consider a multiarmed bandit problem where the expected reward of each arm is a linear function o...
Cette thèse s'inscrit dans les domaines de l'apprentissage statistique et de la statistique séquenti...
We consider a bandit problem which involves sequential sampling from two populations (arms). Each ar...
We consider a stochastic bandit problem with a possibly infinite number of arms. We write p∗ for the...
In this thesis, we study strategies for sequential resource allocation, under the so-called stochast...