We provide a novel, flexible, iterative refinement algorithm to automatically construct an approximate statespace representation for Markov Decision Processes (MDPs). Our approach leverages bisimulation metrics, which have been used in prior work to generate features to represent the state space of MDPs. We address a drawback of this approach, which is the expensive computation of the bisimulation metrics. We propose an algorithm to generate an iteratively improving sequence of state space partitions. Partial metric computations guide the representation search and provide much lower space and computational complexity, while maintaining strong convergence properties. We provide theoretical results guaranteeing convergence as well as experime...
Bisimulation is a notion of behavioural equiva-lence on the states of a transition system. Its defi-...
This dissertation addresses the problem of sequential decision making under uncertainty in large sys...
International audienceBisimulation is a notion of behavioural equivalence on the statesof a transiti...
We provide a novel, flexible, iterative refinement algorithm to automatically construct an approxima...
We present new algorithms for computing and approximating bisimulation metrics in Markov Decision Pr...
Solution methods for MDPs employing approximation allow for more acceptable computation time in dom...
This paper provides new techniques for abstracting the state space of a Markov Decision Process (MD...
Address email We present an approximation scheme for solving Markov Decision Processes (MDPs) in whi...
State abstraction and value function approximation are essential tools for the feasibility of sequen...
We define a metric for measuring behavior similarity between states in a Markov decision process (MD...
International audienceMarkov Decision Processes (MDPs) are employed to model sequential decision-mak...
Probabilistic bisimulation is a widely studied equivalence relation for stochastic systems. However,...
AbstractMany stochastic planning problems can be represented using Markov Decision Processes (MDPs)....
We present a class of metrics, defined on the state space of a finite Markov decision process (MDP)...
Markov decision process (MDP), originally studied in the Operations Research (OR) community, provide...
Bisimulation is a notion of behavioural equiva-lence on the states of a transition system. Its defi-...
This dissertation addresses the problem of sequential decision making under uncertainty in large sys...
International audienceBisimulation is a notion of behavioural equivalence on the statesof a transiti...
We provide a novel, flexible, iterative refinement algorithm to automatically construct an approxima...
We present new algorithms for computing and approximating bisimulation metrics in Markov Decision Pr...
Solution methods for MDPs employing approximation allow for more acceptable computation time in dom...
This paper provides new techniques for abstracting the state space of a Markov Decision Process (MD...
Address email We present an approximation scheme for solving Markov Decision Processes (MDPs) in whi...
State abstraction and value function approximation are essential tools for the feasibility of sequen...
We define a metric for measuring behavior similarity between states in a Markov decision process (MD...
International audienceMarkov Decision Processes (MDPs) are employed to model sequential decision-mak...
Probabilistic bisimulation is a widely studied equivalence relation for stochastic systems. However,...
AbstractMany stochastic planning problems can be represented using Markov Decision Processes (MDPs)....
We present a class of metrics, defined on the state space of a finite Markov decision process (MDP)...
Markov decision process (MDP), originally studied in the Operations Research (OR) community, provide...
Bisimulation is a notion of behavioural equiva-lence on the states of a transition system. Its defi-...
This dissertation addresses the problem of sequential decision making under uncertainty in large sys...
International audienceBisimulation is a notion of behavioural equivalence on the statesof a transiti...