Recent work on Markov Decision Processes (MDPs) covers the use of continuous variables and resources, including time. This work is usually done in a framework of bounded resources and finite temporal horizon for which a total reward criterion is often appropriate. However, most of this work considers discrete effects on continuous variables while considering continuous variables often allows for parametric (possibly continuous) quantification of actions effects. On top of that, infinite horizon MDPs often make use of discounted criterions in order to insure convergence and to account for the difference between a reward obtained now and a reward obtained later. In this paper, we build on the standard MDP framework in order to extend it to co...
The development of embedded systems according to Model-Driven Development relies on two complementar...
A multitude of planning and scheduling applications have to face constrained time deadlines while pr...
This dissertation is mainly motivated by the problem of statistical modeling via a specific point p...
Time is a crucial variable in planning and often requires special attention since it introduces a sp...
In the context of time-dependent problems of planning under uncertainty, most of the problem's compl...
This work tackles the problem of robust zero-shot planning in non-stationary stochastic environments...
In order to allow the temporal coordination of two independent communicating agents, one needs to be...
In the field of sequential decision making and reinforcement learning, it has been observed that goo...
We study a resource leveling problem with variable job duration. The considered problem includes bot...
One of the most significant characteristics of optimizing models is that the behavioral equations in...
Bandits are one of the most basic examples of decision-making with uncertainty. A Markovian restless...
Consider compound Poisson processes with negative drift and no negative jumps, which converge to som...
In this article we propose a Monte Carlo algorithm for sequential parameter learning for a stochasti...
The training of autonomous agents often requires expensive and unsafe trial-and-error interactions w...
AbstractIn this paper a Markov model for Evolutionary Multi-Agent System is recalled. The model allo...
The development of embedded systems according to Model-Driven Development relies on two complementar...
A multitude of planning and scheduling applications have to face constrained time deadlines while pr...
This dissertation is mainly motivated by the problem of statistical modeling via a specific point p...
Time is a crucial variable in planning and often requires special attention since it introduces a sp...
In the context of time-dependent problems of planning under uncertainty, most of the problem's compl...
This work tackles the problem of robust zero-shot planning in non-stationary stochastic environments...
In order to allow the temporal coordination of two independent communicating agents, one needs to be...
In the field of sequential decision making and reinforcement learning, it has been observed that goo...
We study a resource leveling problem with variable job duration. The considered problem includes bot...
One of the most significant characteristics of optimizing models is that the behavioral equations in...
Bandits are one of the most basic examples of decision-making with uncertainty. A Markovian restless...
Consider compound Poisson processes with negative drift and no negative jumps, which converge to som...
In this article we propose a Monte Carlo algorithm for sequential parameter learning for a stochasti...
The training of autonomous agents often requires expensive and unsafe trial-and-error interactions w...
AbstractIn this paper a Markov model for Evolutionary Multi-Agent System is recalled. The model allo...
The development of embedded systems according to Model-Driven Development relies on two complementar...
A multitude of planning and scheduling applications have to face constrained time deadlines while pr...
This dissertation is mainly motivated by the problem of statistical modeling via a specific point p...