AbstractThe following optimality principle is established for finite undiscounted or discounted Markov decision processes: If a policy is (gain, bias, or discounted) optimal in one state, it is also optimal for all states reachable from this state using this policy. The optimality principle is used constructively to demonstrate the existence of a policy that is optimal in every state, and then to derive the coupled functional equations satisfied by the optimal return vectors. This reverses the usual sequence, where one first establishes (via policy iteration or linear programming) the solvability of the coupled functional equations, and then shows that the solution is indeed the optimal return vector and that the maximizing policy for the f...
AbstractWe consider a Markov decision process with an uncountable state space for which the vector p...
AbstractA finite-state iterative scheme introduced by White (in “Recent Developments in Markov Decis...
We consider multistage decision processes where criterion function is an expectation of minimum func...
AbstractThe following optimality principle is established for finite undiscounted or discounted Mark...
A Markov decision process (MDP) relies on the notions of state, describing the current situation of ...
summary:In this note we focus attention on identifying optimal policies and on elimination suboptima...
AbstractThis paper concerns a discrete-time Markov decision model with an infinite planning horizon....
AbstractWe consider the minimizing risk problems in discounted Markov decisions processes with count...
AbstractThis paper studies the minimizing risk problems in Markov decision processes with countable ...
This paper considers Markov decision processes (MDPs) with unbounded rates, as a function of state. ...
AbstractThis paper establishes a rather complete optimality theory for the average cost semi-Markov ...
AbstractThis note considers the conditions that have been put on the set of transition matrices of f...
AbstractMost quantities of interest in discounted and undiscounted (semi-) Markov decision processes...
We establish the existence of a solution to the optimality equation for discounted finite Markov dec...
AbstractWe consider a Markov decision process with an uncountable state space and multiple rewards. ...
AbstractWe consider a Markov decision process with an uncountable state space for which the vector p...
AbstractA finite-state iterative scheme introduced by White (in “Recent Developments in Markov Decis...
We consider multistage decision processes where criterion function is an expectation of minimum func...
AbstractThe following optimality principle is established for finite undiscounted or discounted Mark...
A Markov decision process (MDP) relies on the notions of state, describing the current situation of ...
summary:In this note we focus attention on identifying optimal policies and on elimination suboptima...
AbstractThis paper concerns a discrete-time Markov decision model with an infinite planning horizon....
AbstractWe consider the minimizing risk problems in discounted Markov decisions processes with count...
AbstractThis paper studies the minimizing risk problems in Markov decision processes with countable ...
This paper considers Markov decision processes (MDPs) with unbounded rates, as a function of state. ...
AbstractThis paper establishes a rather complete optimality theory for the average cost semi-Markov ...
AbstractThis note considers the conditions that have been put on the set of transition matrices of f...
AbstractMost quantities of interest in discounted and undiscounted (semi-) Markov decision processes...
We establish the existence of a solution to the optimality equation for discounted finite Markov dec...
AbstractWe consider a Markov decision process with an uncountable state space and multiple rewards. ...
AbstractWe consider a Markov decision process with an uncountable state space for which the vector p...
AbstractA finite-state iterative scheme introduced by White (in “Recent Developments in Markov Decis...
We consider multistage decision processes where criterion function is an expectation of minimum func...