This work tackles the problem of robust zero-shot planning in non-stationary stochastic environments. We study Markov Decision Processes (MDPs) evolving over time and consider Model-Based Reinforcement Learning algorithms in this setting. We make two hypotheses: 1) the environment evolves continuously with a bounded evolution rate; 2) a current model is known at each decision epoch but not its evolution. Our contribution can be presented in four points. 1) we define a specific class of MDPs that we call Non-Stationary MDPs (NSMDPs). We introduce the notion of regular evolution by making an hypothesis of Lipschitz-Continuity on the transition and reward functions w.r.t. time; 2) we consider a planning agent using the current model of the env...
[EN]In this work we present several ideas for planning under uncertainty. Our intention is to apply...
In this paper, we develop finite-sample inference procedures for stationary and nonstationary autore...
One common way of describing the tasks addressable by machine learning is to break them down into th...
Time is a crucial variable in planning and often requires special attention since it introduces a sp...
In the field of sequential decision making and reinforcement learning, it has been observed that goo...
The training of autonomous agents often requires expensive and unsafe trial-and-error interactions w...
In the context of time-dependent problems of planning under uncertainty, most of the problem's compl...
Recent work on Markov Decision Processes (MDPs) covers the use of continuous variables and resources...
AbstractIn this paper a Markov model for Evolutionary Multi-Agent System is recalled. The model allo...
Artificial Intelligence (AI) is the branch of the Computer Science field that tries to imbue intell...
This paper tackles a problem of UAV safe path planning in an urban environment where the onboard sen...
Bandits are one of the most basic examples of decision-making with uncertainty. A Markovian restless...
In order to allow the temporal coordination of two independent communicating agents, one needs to be...
Markov Decision Processes (MDPs) constitute a mathematical framework for modelling systems featuring...
General purpose intelligent learning agents cycle through (complex,non-MDP) sequences of observatio...
[EN]In this work we present several ideas for planning under uncertainty. Our intention is to apply...
In this paper, we develop finite-sample inference procedures for stationary and nonstationary autore...
One common way of describing the tasks addressable by machine learning is to break them down into th...
Time is a crucial variable in planning and often requires special attention since it introduces a sp...
In the field of sequential decision making and reinforcement learning, it has been observed that goo...
The training of autonomous agents often requires expensive and unsafe trial-and-error interactions w...
In the context of time-dependent problems of planning under uncertainty, most of the problem's compl...
Recent work on Markov Decision Processes (MDPs) covers the use of continuous variables and resources...
AbstractIn this paper a Markov model for Evolutionary Multi-Agent System is recalled. The model allo...
Artificial Intelligence (AI) is the branch of the Computer Science field that tries to imbue intell...
This paper tackles a problem of UAV safe path planning in an urban environment where the onboard sen...
Bandits are one of the most basic examples of decision-making with uncertainty. A Markovian restless...
In order to allow the temporal coordination of two independent communicating agents, one needs to be...
Markov Decision Processes (MDPs) constitute a mathematical framework for modelling systems featuring...
General purpose intelligent learning agents cycle through (complex,non-MDP) sequences of observatio...
[EN]In this work we present several ideas for planning under uncertainty. Our intention is to apply...
In this paper, we develop finite-sample inference procedures for stationary and nonstationary autore...
One common way of describing the tasks addressable by machine learning is to break them down into th...