In every finite-state leavable gambling problem and in every finite-state Markov decision process with discounted, negative or positive reward criteria there exists a Markov strategy which is monotonically improving and optimal in the limit along every history. An example is given to show that for the positive and gambling cases such strategies cannot be constructed by simply switching to a "better" action or gamble at each successive return to a state. Key words and phrases: gambling problem, Markov decision process, strategy, stationary strategy, monotonically improving strategy, limit-optimal strategy
We are interested in the existence of pure and stationary optimal strategies in Markov decision proc...
AbstractThis paper deals with a discrete time Markov decision model with a finite state space, arbit...
Consider a Markov decision process with countable state space S and finite action space A. If in sta...
In every finite-state leavable gambling problem and in every finite-state Markov decision process wi...
In every finite-state leavable gambling problem and in every finite-state Markov decision process wi...
Suppose you are in a casino with a number of dollars you wish to gamble. You may quit whenever you p...
(Extended version of Memorandum COSOR 81-11) This paper deals with total reward Markov decision proc...
This paper gives a systematic treatment of results about the existence of various types of nearly-op...
We consider the relationship between the reward function and the existence of (nearly) optimal Marko...
We examine the use of stationary and Markov strategies in zero-sum stochastic games with finite stat...
In several standard models of dynamic programming (gambling houses, MDPs, POMDPs), we prove the exis...
International audienceIn several standard models of dynamic programming (gambling houses, MDPs, POMD...
We are interested in the existence of pure and stationary optimal strategies in Markov decision proc...
AbstractThis paper deals with a discrete time Markov decision model with a finite state space, arbit...
Consider a Markov decision process with countable state space S and finite action space A. If in sta...
In every finite-state leavable gambling problem and in every finite-state Markov decision process wi...
In every finite-state leavable gambling problem and in every finite-state Markov decision process wi...
Suppose you are in a casino with a number of dollars you wish to gamble. You may quit whenever you p...
(Extended version of Memorandum COSOR 81-11) This paper deals with total reward Markov decision proc...
This paper gives a systematic treatment of results about the existence of various types of nearly-op...
We consider the relationship between the reward function and the existence of (nearly) optimal Marko...
We examine the use of stationary and Markov strategies in zero-sum stochastic games with finite stat...
In several standard models of dynamic programming (gambling houses, MDPs, POMDPs), we prove the exis...
International audienceIn several standard models of dynamic programming (gambling houses, MDPs, POMD...
We are interested in the existence of pure and stationary optimal strategies in Markov decision proc...
AbstractThis paper deals with a discrete time Markov decision model with a finite state space, arbit...
Consider a Markov decision process with countable state space S and finite action space A. If in sta...