We consider a semi-Markov decision process with arbitrary action space; the state space is the nonnegative integers. As in queueing systems, we assume that {0, 1, 2, ..., n + N} is the set of states accessible from state n in one transition, where N is finite and independent of n. The novel feature of this model is that the one-period reward is not required to be uniformly bounded; instead, we merely assume it to be bounded by a polynomial in n. Our main concern is with the average cost problem. A set of conditions sufficient for there to be an optimal stationary policy which can be obtained from the usual functional equation is developed. These conditions are quite weak and, as illustrated in several queueing examples, are easily verified.
AbstractThis paper deals with a discrete time Markov decision model with a finite state space, arbit...
We consider partially observable Markov decision processes with finite or countably infinite (core) ...
AbstractFor a vector-valued Markov decision process with discounted reward criterion, we introduce a...
Considered are semi-Markov decision processes (SMDPs) with finite state and action spaces. We study ...
AbstractThis paper establishes a rather complete optimality theory for the average cost semi-Markov ...
We shall be concerned with the optimization problem of semi-Markov decision processes with countable...
The first part considers discrete-time constrained Markov Decision Processes (MDPs). At each epoch, ...
Consider a Markov decision process with countable state space S and finite action space A. If in sta...
AbstractWe consider a Markov decision process with an uncountable state space for which the vector p...
Abstract. Bounded parameter Markov Decision Processes (BMDPs) address the issue of dealing with unce...
AbstractThis paper deals with the average expected reward criterion for continuous-time Markov decis...
This work considers denumerable state Markov Decision Chains endowed with a long-run expected averag...
International audienceWe consider a discrete-time Markov decision process with Borel state and actio...
summary:In this paper we give a new set of verifiable conditions for the existence of average optima...
We consider partially observable Markov decision processes with finite or count-ably infinite (core)...
AbstractThis paper deals with a discrete time Markov decision model with a finite state space, arbit...
We consider partially observable Markov decision processes with finite or countably infinite (core) ...
AbstractFor a vector-valued Markov decision process with discounted reward criterion, we introduce a...
Considered are semi-Markov decision processes (SMDPs) with finite state and action spaces. We study ...
AbstractThis paper establishes a rather complete optimality theory for the average cost semi-Markov ...
We shall be concerned with the optimization problem of semi-Markov decision processes with countable...
The first part considers discrete-time constrained Markov Decision Processes (MDPs). At each epoch, ...
Consider a Markov decision process with countable state space S and finite action space A. If in sta...
AbstractWe consider a Markov decision process with an uncountable state space for which the vector p...
Abstract. Bounded parameter Markov Decision Processes (BMDPs) address the issue of dealing with unce...
AbstractThis paper deals with the average expected reward criterion for continuous-time Markov decis...
This work considers denumerable state Markov Decision Chains endowed with a long-run expected averag...
International audienceWe consider a discrete-time Markov decision process with Borel state and actio...
summary:In this paper we give a new set of verifiable conditions for the existence of average optima...
We consider partially observable Markov decision processes with finite or count-ably infinite (core)...
AbstractThis paper deals with a discrete time Markov decision model with a finite state space, arbit...
We consider partially observable Markov decision processes with finite or countably infinite (core) ...
AbstractFor a vector-valued Markov decision process with discounted reward criterion, we introduce a...