The paper studies optimization of average-reward continuous-time finite state and action Markov Deci-sion Processes with multiple criteria and constraints. Under the standard unichain assumption, we prove the existence of optimal K-switching strategies for feasi-ble problems with K constraints. For switching ran-domized strategies, the decisions depend on the current state and the the time spent in the current state after the last jump. For stationary strategies, these functions do not depend on sojourn times, i.e. they are constant in time. For K-snitching strategies, these functions are piece-wise constant and the total number of jumps is limited by K. If there is no absorbing states, there exist also optimal K-randomized policies. We con...
summary:In this paper, we study continuous time Markov decision processes (CTMDPs) with a denumerabl...
Continuous-time Markov decision processes (CTMDPs) are widely used for the control of queueing syste...
We consider mean-field control problems in discrete time with discounted reward, infinite time horiz...
AbstractThis paper deals with the average expected reward criterion for continuous-time Markov decis...
AbstractThis paper deals with a discrete time Markov decision model with a finite state space, arbit...
AbstractThe time average reward for a discrete-time controlled Markov process subject to a time-aver...
We consider multistage decision processes where criterion function is an expectation of minimum func...
We study the policy iteration algorithm (PIA) for continuous-time jump Markov decision processes in ...
Time-average Markov decision problems are considered for the finite state and action spaces. Several...
Consider a Markov decision process with countable state space S and finite action space A. If in sta...
We consider Howard's policy iteration algorithm for multichained finite state and action Markov deci...
The paper deals with continuous time Markov decision processes on a fairly general state space. The ...
In this article, we study continuous-time Markov decision processes in Polish spaces. The optimality...
The first part considers discrete-time constrained Markov Decision Processes (MDPs). At each epoch, ...
We consider a discrete time Markov Decision Process with infinite horizon. The criterion to be maxim...
summary:In this paper, we study continuous time Markov decision processes (CTMDPs) with a denumerabl...
Continuous-time Markov decision processes (CTMDPs) are widely used for the control of queueing syste...
We consider mean-field control problems in discrete time with discounted reward, infinite time horiz...
AbstractThis paper deals with the average expected reward criterion for continuous-time Markov decis...
AbstractThis paper deals with a discrete time Markov decision model with a finite state space, arbit...
AbstractThe time average reward for a discrete-time controlled Markov process subject to a time-aver...
We consider multistage decision processes where criterion function is an expectation of minimum func...
We study the policy iteration algorithm (PIA) for continuous-time jump Markov decision processes in ...
Time-average Markov decision problems are considered for the finite state and action spaces. Several...
Consider a Markov decision process with countable state space S and finite action space A. If in sta...
We consider Howard's policy iteration algorithm for multichained finite state and action Markov deci...
The paper deals with continuous time Markov decision processes on a fairly general state space. The ...
In this article, we study continuous-time Markov decision processes in Polish spaces. The optimality...
The first part considers discrete-time constrained Markov Decision Processes (MDPs). At each epoch, ...
We consider a discrete time Markov Decision Process with infinite horizon. The criterion to be maxim...
summary:In this paper, we study continuous time Markov decision processes (CTMDPs) with a denumerabl...
Continuous-time Markov decision processes (CTMDPs) are widely used for the control of queueing syste...
We consider mean-field control problems in discrete time with discounted reward, infinite time horiz...