In black-box optimization problems, we aim to maximize an unknown objective function, where the function is only accessible through feedbacks of an evaluation or simulation oracle. In real-life, the feedbacks of such oracles are often noisy and available after some unknown delay that may depend on the computation time of the oracle. Additionally, if the exact evaluations are expensive but coarse approximations are available at a lower cost, the feedbacks can have multi-fidelity. In order to address this problem, we propose a generic extension of hierarchical optimistic tree search (HOO), called ProCrastinated Tree Search (PCTS), that flexibly accommodates a delay and noise-tolerant bandit algorithm. We provide a generic proof technique to q...
We consider online planning in Markov decision processes (MDPs). In online planning, the agent focus...
The quality of solution provided by a search heuristic on a particular problem is by no means an abs...
International audience``Simple regret'' algorithms are designed for noisy optimization in unstructur...
International audienceIn black-box optimization problems, we aim to maximize an unknown objective fu...
Thesis: Ph. D., Massachusetts Institute of Technology, Sloan School of Management, Operations Resear...
Bandit based methods for tree search have recently gained popularity when applied to huge trees, e.g...
130 pagesThis work covers several aspects of the optimism in the face of uncertainty principle appli...
Black-box optimization (BBO) problems occur frequently in many engineering and scientific discipline...
Bandit based methods for tree search have recently gained popularity when applied to huge trees, e.g...
We address online linear optimization problems when the possible actions of the decision maker are r...
International audienceThe performance measure of an algorithm is a crucial part of its analysis. The...
International audienceWe study the problem of black-box optimization of a function $f$ of any dimens...
Kernel-based bandit is an extensively studied black-box optimization problem, in which the objective...
International audienceThe black box complexity of noisy-optimization is a great research area, with ...
Regret minimization is important in both the Multi-Armed Bandit problem and Monte-Carlo Tree Search ...
We consider online planning in Markov decision processes (MDPs). In online planning, the agent focus...
The quality of solution provided by a search heuristic on a particular problem is by no means an abs...
International audience``Simple regret'' algorithms are designed for noisy optimization in unstructur...
International audienceIn black-box optimization problems, we aim to maximize an unknown objective fu...
Thesis: Ph. D., Massachusetts Institute of Technology, Sloan School of Management, Operations Resear...
Bandit based methods for tree search have recently gained popularity when applied to huge trees, e.g...
130 pagesThis work covers several aspects of the optimism in the face of uncertainty principle appli...
Black-box optimization (BBO) problems occur frequently in many engineering and scientific discipline...
Bandit based methods for tree search have recently gained popularity when applied to huge trees, e.g...
We address online linear optimization problems when the possible actions of the decision maker are r...
International audienceThe performance measure of an algorithm is a crucial part of its analysis. The...
International audienceWe study the problem of black-box optimization of a function $f$ of any dimens...
Kernel-based bandit is an extensively studied black-box optimization problem, in which the objective...
International audienceThe black box complexity of noisy-optimization is a great research area, with ...
Regret minimization is important in both the Multi-Armed Bandit problem and Monte-Carlo Tree Search ...
We consider online planning in Markov decision processes (MDPs). In online planning, the agent focus...
The quality of solution provided by a search heuristic on a particular problem is by no means an abs...
International audience``Simple regret'' algorithms are designed for noisy optimization in unstructur...