We consider the problem of sequentially choosing between a set of unbiased Monte Carlo estimators to minimize the mean-squared-error (MSE) of a final combined estimate. By reducing this task to a stochastic multi-armed bandit problem, we show that well developed allocation strategies can be used to achieve an MSE that approaches that of the best estimator chosen in retrospect. We then extend these developments to a scenario where alternative estimators have different, possibly stochastic costs. The outcome is a new set of adaptive Monte Carlo strategies that provide stronger guarantees than previous approaches while offering practical advantages. 1
In experiments, researchers commonly allocate subjects randomly and equally to the different treatme...
Different allocation strategies can be found in the literature to deal with the multi-armed bandit p...
24 pages, 1 figureThis paper focuses on the study of an original combination of the Multilevel Monte...
We consider the problem of sequentially choos-ing between a set of unbiased Monte Carlo estimators t...
This document presents in a unified way different results about the optimal solution of several mult...
International audienceThis paper considers the problem of maximizing an expectation function over a ...
Multi-armed bandit problem is an important optimization game that requires an exploration-exploitati...
We study a multi-armed bandit problem in a setting where covariates are available. We take a nonpara...
This thesis is dedicated to the study of resource allocation problems in uncertain environments, whe...
The main topics adressed in this thesis lie in the general domain of sequential learning, and in par...
This paper studies an important sequential decision making problem known as the multi-armed stochast...
In this thesis, we study strategies for sequential resource allocation, under the so-called stochast...
The stochastic multi-armed bandit problem is an important model for studying the exploration-exploit...
International audienceWe consider the problem of finding the best arm in a stochastic multi-armed ba...
Linking online planning for MDPs with their special case of stochastic multi-armed bandit problems, ...
In experiments, researchers commonly allocate subjects randomly and equally to the different treatme...
Different allocation strategies can be found in the literature to deal with the multi-armed bandit p...
24 pages, 1 figureThis paper focuses on the study of an original combination of the Multilevel Monte...
We consider the problem of sequentially choos-ing between a set of unbiased Monte Carlo estimators t...
This document presents in a unified way different results about the optimal solution of several mult...
International audienceThis paper considers the problem of maximizing an expectation function over a ...
Multi-armed bandit problem is an important optimization game that requires an exploration-exploitati...
We study a multi-armed bandit problem in a setting where covariates are available. We take a nonpara...
This thesis is dedicated to the study of resource allocation problems in uncertain environments, whe...
The main topics adressed in this thesis lie in the general domain of sequential learning, and in par...
This paper studies an important sequential decision making problem known as the multi-armed stochast...
In this thesis, we study strategies for sequential resource allocation, under the so-called stochast...
The stochastic multi-armed bandit problem is an important model for studying the exploration-exploit...
International audienceWe consider the problem of finding the best arm in a stochastic multi-armed ba...
Linking online planning for MDPs with their special case of stochastic multi-armed bandit problems, ...
In experiments, researchers commonly allocate subjects randomly and equally to the different treatme...
Different allocation strategies can be found in the literature to deal with the multi-armed bandit p...
24 pages, 1 figureThis paper focuses on the study of an original combination of the Multilevel Monte...