In the context of tree-search stochastic planning algorithms where a generative model is available, we consider on-line planning algorithms building trees in order to recommend an action. We investigate the question of avoiding re-planning in subsequent decision steps by directly using sub-trees as action recommender. Firstly, we propose a method for open loop control via a new algorithm taking the decision of re-planning or not at each time step based on an analysis of the statistics of the sub-tree. Secondly, we show that the probability of selecting a suboptimal action at any depth of the tree can be upper bounded and converges towards zero. Moreover, this upper bound decays in a logarithmic way between subsequent depths. This leads to a...
Simulating complex industrial manipulation tasks (e.g., assembly, disassembly and maintenance tasks)...
Time is a crucial variable in planning and often requires special attention since it introduces a sp...
This work tackles the problem of robust zero-shot planning in non-stationary stochastic environments...
National audienceIn the context of tree-search stochastic planning algorithms where a generative mod...
Adaptive regularized framework using cubics has emerged as an alternative to line-search and trust-r...
In the field of sequential decision making and reinforcement learning, it has been observed that goo...
In many engineering optimization problems, the number of function evaluations is often very limited ...
A multitude of planning and scheduling applications have to face constrained time deadlines while pr...
This paper presents a new integrated procedure to tune a control law for overactuated mechanical sys...
his paper addresses the resolution of combinatorial optimization problems presenting some kind of re...
The training of autonomous agents often requires expensive and unsafe trial-and-error interactions w...
In the context of time-dependent problems of planning under uncertainty, most of the problem's compl...
This paper tackles a problem of UAV safe path planning in an urban environment where the onboard sen...
In order to handle constrained optimization problems with a large number of design variables, a new ...
Key words to describe the work: Evolutionary computing, Artificial Intelligence, Free Search. Key R...
Simulating complex industrial manipulation tasks (e.g., assembly, disassembly and maintenance tasks)...
Time is a crucial variable in planning and often requires special attention since it introduces a sp...
This work tackles the problem of robust zero-shot planning in non-stationary stochastic environments...
National audienceIn the context of tree-search stochastic planning algorithms where a generative mod...
Adaptive regularized framework using cubics has emerged as an alternative to line-search and trust-r...
In the field of sequential decision making and reinforcement learning, it has been observed that goo...
In many engineering optimization problems, the number of function evaluations is often very limited ...
A multitude of planning and scheduling applications have to face constrained time deadlines while pr...
This paper presents a new integrated procedure to tune a control law for overactuated mechanical sys...
his paper addresses the resolution of combinatorial optimization problems presenting some kind of re...
The training of autonomous agents often requires expensive and unsafe trial-and-error interactions w...
In the context of time-dependent problems of planning under uncertainty, most of the problem's compl...
This paper tackles a problem of UAV safe path planning in an urban environment where the onboard sen...
In order to handle constrained optimization problems with a large number of design variables, a new ...
Key words to describe the work: Evolutionary computing, Artificial Intelligence, Free Search. Key R...
Simulating complex industrial manipulation tasks (e.g., assembly, disassembly and maintenance tasks)...
Time is a crucial variable in planning and often requires special attention since it introduces a sp...
This work tackles the problem of robust zero-shot planning in non-stationary stochastic environments...