We consider the problem of sequentially choos-ing between a set of unbiased Monte Carlo estimators to minimize the mean-squared-error (MSE) of a final combined estimate. By reduc-ing this task to a stochastic multi-armed bandit problem, we show that well developed allocation strategies can be used to achieve an MSE that ap-proaches that of the best estimator chosen in ret-rospect. We then extend these developments to a scenario where alternative estimators have dif-ferent, possibly stochastic costs. The outcome is a new set of adaptive Monte Carlo strategies that provide stronger guarantees than previous approaches while offering practical advantages. 1
Different allocation strategies can be found in the literature to deal with the multi-armed bandit p...
Consider a Bayesian sequential allocation problem that incorporates a covariate. The goal is to maxi...
We consider a multi-armed bandit problem in a setting where each arm produces a noisy reward realiza...
We consider the problem of sequentially choos-ing between a set of unbiased Monte Carlo estimators t...
This document presents in a unified way different results about the optimal solution of several mult...
International audienceThis paper considers the problem of maximizing an expectation function over a ...
Multi-armed bandit problem is an important optimization game that requires an exploration-exploitati...
We study a multi-armed bandit problem in a setting where covariates are available. We take a nonpara...
The main topics adressed in this thesis lie in the general domain of sequential learning, and in par...
This thesis is dedicated to the study of resource allocation problems in uncertain environments, whe...
The stochastic multi-armed bandit problem is an important model for studying the exploration-exploit...
In this thesis, we study strategies for sequential resource allocation, under the so-called stochast...
This paper studies an important sequential decision making problem known as the multi-armed stochast...
International audienceThe stochastic multi-armed bandit problem is a popular model of the exploratio...
Dans cette thèse, nous étudions des stratégies d’allocation séquentielle de ressources. Le modèle st...
Different allocation strategies can be found in the literature to deal with the multi-armed bandit p...
Consider a Bayesian sequential allocation problem that incorporates a covariate. The goal is to maxi...
We consider a multi-armed bandit problem in a setting where each arm produces a noisy reward realiza...
We consider the problem of sequentially choos-ing between a set of unbiased Monte Carlo estimators t...
This document presents in a unified way different results about the optimal solution of several mult...
International audienceThis paper considers the problem of maximizing an expectation function over a ...
Multi-armed bandit problem is an important optimization game that requires an exploration-exploitati...
We study a multi-armed bandit problem in a setting where covariates are available. We take a nonpara...
The main topics adressed in this thesis lie in the general domain of sequential learning, and in par...
This thesis is dedicated to the study of resource allocation problems in uncertain environments, whe...
The stochastic multi-armed bandit problem is an important model for studying the exploration-exploit...
In this thesis, we study strategies for sequential resource allocation, under the so-called stochast...
This paper studies an important sequential decision making problem known as the multi-armed stochast...
International audienceThe stochastic multi-armed bandit problem is a popular model of the exploratio...
Dans cette thèse, nous étudions des stratégies d’allocation séquentielle de ressources. Le modèle st...
Different allocation strategies can be found in the literature to deal with the multi-armed bandit p...
Consider a Bayesian sequential allocation problem that incorporates a covariate. The goal is to maxi...
We consider a multi-armed bandit problem in a setting where each arm produces a noisy reward realiza...