We address the problem of finding the maximizer of a nonlinear smooth function, that can only be evaluated point-wise, subject to constraints on the number of permitted function evaluations. This problem is also known as fixed-budget best arm identification in the multi-armed bandit literature. We introduce a Bayesian approach for this problem and show that it empirically outperforms both the existing frequentist counterpart and other Bayesian optimization methods. The Bayesian approach places emphasis on detailed modelling, including the modelling of correlations among the arms. As a result, it can perform well in situations where the number of arms is much larger than the number of allowed function evaluation, whereas the frequentist coun...
This thesis lies in the fields of artificial intelligence, sequential statistics and optimization. W...
We consider fixed-budget best arm identification in two-armed bandit problems. One of the longstandi...
This paper investigates the best arm identification (BAI) problem in stochastic multi-armed bandits ...
We address the problem of finding the maximizer of a nonlinear smooth function, that can only be eva...
Many real-world functions are defined over both categorical and category-specific continuous variabl...
International audienceThe stochastic multi-armed bandit model is a simple abstraction that has prove...
International audienceTop Two algorithms arose as an adaptation of Thompson sampling to best arm ide...
Best-arm identification (BAI) in a fixed-budget setting is a bandit problem where the learning agent...
The stochastic multi-armed bandit model is a simple abstraction that has proven useful in many diffe...
We consider a stochastic bandit problem with a possibly infinite number of arms. We write p∗ for the...
This document presents in a unified way different results about the optimal solution of several mult...
We consider the top-k arm identification problem for multi-armed bandits with rewards belonging to a...
International audienceWe study the best-arm identification problem in linear bandit, where the rewar...
In this thesis, we study strategies for sequential resource allocation, under the so-called stochast...
International audienceWe consider the problem of best arm identification in the multi-armed bandit m...
This thesis lies in the fields of artificial intelligence, sequential statistics and optimization. W...
We consider fixed-budget best arm identification in two-armed bandit problems. One of the longstandi...
This paper investigates the best arm identification (BAI) problem in stochastic multi-armed bandits ...
We address the problem of finding the maximizer of a nonlinear smooth function, that can only be eva...
Many real-world functions are defined over both categorical and category-specific continuous variabl...
International audienceThe stochastic multi-armed bandit model is a simple abstraction that has prove...
International audienceTop Two algorithms arose as an adaptation of Thompson sampling to best arm ide...
Best-arm identification (BAI) in a fixed-budget setting is a bandit problem where the learning agent...
The stochastic multi-armed bandit model is a simple abstraction that has proven useful in many diffe...
We consider a stochastic bandit problem with a possibly infinite number of arms. We write p∗ for the...
This document presents in a unified way different results about the optimal solution of several mult...
We consider the top-k arm identification problem for multi-armed bandits with rewards belonging to a...
International audienceWe study the best-arm identification problem in linear bandit, where the rewar...
In this thesis, we study strategies for sequential resource allocation, under the so-called stochast...
International audienceWe consider the problem of best arm identification in the multi-armed bandit m...
This thesis lies in the fields of artificial intelligence, sequential statistics and optimization. W...
We consider fixed-budget best arm identification in two-armed bandit problems. One of the longstandi...
This paper investigates the best arm identification (BAI) problem in stochastic multi-armed bandits ...