International audienceWe study in this paper a multiobjective dynamic programm-ming where all the criteria are in the form of total expected sum of costs till absorption in some set of states M. We assume that instantaneous costs are strictly positive and make no assumption on the ergodic structure of the Markov Decision Process. Our main result is to extend the linear program solution approach that was previously derived for transient CMDPs (Constrained Markov Decision Processes) to general ergodic structure. Several (additive) cost met-rics are defined and (possibly randomized) routing policies are sought which minimize one of the costs subject to constraints over the other objectives
A Markov decision process (MDP) relies on the notions of state, describing the current situation of ...
AbstractWe consider a Markov decision process with an uncountable state space and multiple rewards. ...
We give mild conditions for the existence of optimal solutions for a Markov decision problem with av...
International audienceWe study in this paper a multiobjective dynamic programm-ming where all the cr...
International audienceThis paper deals with discrete-time Markov Decision Processes (MDP's) under co...
The first part considers discrete-time constrained Markov Decision Processes (MDPs). At each epoch, ...
In this paper, a mapping is developed between the ‘multichain’ and ‘unchain’ linear programs for ave...
We consider the problem of designing policies for Markov decision processes (MDPs) with dynamic cohe...
We consider a discounted Markov Decision Process (MDP) supplemented with the requirement that anothe...
International audienceWe consider a discrete-time constrained discounted Markov decision process (MD...
This note considers finite state and action spaces controlled Markov chains with multiple costs. The...
Linear Programming is known to be an important and useful tool for solving Markov Decision Processes...
We consider multistage decision processes where criterion function is an expectation of minimum func...
This letter investigates the structure of the optimal policy for a class of Markov decision processe...
In this paper we consider a constrained optimization of discrete time Markov Decision Processes (MDP...
A Markov decision process (MDP) relies on the notions of state, describing the current situation of ...
AbstractWe consider a Markov decision process with an uncountable state space and multiple rewards. ...
We give mild conditions for the existence of optimal solutions for a Markov decision problem with av...
International audienceWe study in this paper a multiobjective dynamic programm-ming where all the cr...
International audienceThis paper deals with discrete-time Markov Decision Processes (MDP's) under co...
The first part considers discrete-time constrained Markov Decision Processes (MDPs). At each epoch, ...
In this paper, a mapping is developed between the ‘multichain’ and ‘unchain’ linear programs for ave...
We consider the problem of designing policies for Markov decision processes (MDPs) with dynamic cohe...
We consider a discounted Markov Decision Process (MDP) supplemented with the requirement that anothe...
International audienceWe consider a discrete-time constrained discounted Markov decision process (MD...
This note considers finite state and action spaces controlled Markov chains with multiple costs. The...
Linear Programming is known to be an important and useful tool for solving Markov Decision Processes...
We consider multistage decision processes where criterion function is an expectation of minimum func...
This letter investigates the structure of the optimal policy for a class of Markov decision processe...
In this paper we consider a constrained optimization of discrete time Markov Decision Processes (MDP...
A Markov decision process (MDP) relies on the notions of state, describing the current situation of ...
AbstractWe consider a Markov decision process with an uncountable state space and multiple rewards. ...
We give mild conditions for the existence of optimal solutions for a Markov decision problem with av...