We consider partially observable Markov decision processes (POMDPs) with limit-average payoff, where a reward value in the interval [0,1] is associated with every transition, and the payoff of an infinite path is the long-run average of the rewards. We consider two types of path constraints: (i) a quantitative constraint defines the set of paths where the payoff is at least a given threshold λ1ε(0,1]; and (ii) a qualitative constraint which is a special case of the quantitative constraint with λ1=1. We consider the computation of the almost-sure winning set, where the controller needs to ensure that the path constraint is satisfied with probability 1. Our main results for qualitative path constraints are as follows: (i) the problem of decid...
A standard objective in partially-observable Markov decision processes (POMDPs) is to find a policy ...
Partially observable Markov decision processes (POMDPs) are standard models for dynamic systems with...
We consider partially observable Markov decision processes (POMDPs) with omega-regular conditions sp...
We study partially observable Markov decision processes (POMDPs) with objectives used in verificatio...
We study the problem of approximation of optimal values in partially-observable Markov decision proc...
We consider partially observable Markov decision processes (POMDPs) with ω-regular conditions specif...
We consider partially observable Markov decision processes (POMDPs) with a set of target states and ...
We consider partially observable Markov decision processes (POMDPs) with ω-regular conditions specif...
We consider partially observable Markov decision processes (POMDPs) with a set of target states and ...
We consider partially observable Markov decision processes (POMDPs) with ω-regular conditions specif...
Optimal policy computation in finite-horizon Markov decision processes is a classical problem in opt...
Partially observable Markov decision processes (POMDPs) are standard models for dynamic systems with...
The Partially Observable Markov Decision Process (POMDP) framework has proven useful in planning dom...
Partially-Observable Markov Decision Processes (POMDPs) are a well-known stochastic model for sequen...
A standard objective in partially-observable Markov decision processes (POMDPs) is to find a policy ...
A standard objective in partially-observable Markov decision processes (POMDPs) is to find a policy ...
Partially observable Markov decision processes (POMDPs) are standard models for dynamic systems with...
We consider partially observable Markov decision processes (POMDPs) with omega-regular conditions sp...
We study partially observable Markov decision processes (POMDPs) with objectives used in verificatio...
We study the problem of approximation of optimal values in partially-observable Markov decision proc...
We consider partially observable Markov decision processes (POMDPs) with ω-regular conditions specif...
We consider partially observable Markov decision processes (POMDPs) with a set of target states and ...
We consider partially observable Markov decision processes (POMDPs) with ω-regular conditions specif...
We consider partially observable Markov decision processes (POMDPs) with a set of target states and ...
We consider partially observable Markov decision processes (POMDPs) with ω-regular conditions specif...
Optimal policy computation in finite-horizon Markov decision processes is a classical problem in opt...
Partially observable Markov decision processes (POMDPs) are standard models for dynamic systems with...
The Partially Observable Markov Decision Process (POMDP) framework has proven useful in planning dom...
Partially-Observable Markov Decision Processes (POMDPs) are a well-known stochastic model for sequen...
A standard objective in partially-observable Markov decision processes (POMDPs) is to find a policy ...
A standard objective in partially-observable Markov decision processes (POMDPs) is to find a policy ...
Partially observable Markov decision processes (POMDPs) are standard models for dynamic systems with...
We consider partially observable Markov decision processes (POMDPs) with omega-regular conditions sp...