Bisimulation is a notion of behavioural equiva-lence on the states of a transition system. Its defi-nition has been extended to Markov decision pro-cesses, where it can be used to aggregate states. A bisimulation metric is a quantitative analog of bisimulation that measures how similar states are from a the perspective of long-term behavior. Bisimulation metrics have been used to establish approximation bounds for state aggregation and other forms of value function approximation. In this paper, we prove that a bisimulation metric defined on the state space of a Markov decision process is the optimal value function of an opti-mal coupling of two copies of the original model. We prove the result in the general case of con-tinuous state spaces...
Bisimulation metrics are used to estimate the behavioural distance between probabilistic systems. Th...
This paper defines action-labelled quantitative transition systems as a general framework for combin...
Bisimulation metrics are used to estimate the behavioural distance between probabilistic systems. Th...
International audienceBisimulation is a notion of behavioural equivalence on the statesof a transiti...
International audienceBisimulation is a notion of behavioural equivalence on the statesof a transiti...
We present new algorithms for computing and approximating bisimulation metrics in Markov Decision Pr...
Bisimulation metrics define a distance measure between states of a Markov decision process (MDP) bas...
International audienceWe transfer a notion of quantitative bisimilarity for labelled Markov processe...
International audienceWe transfer a notion of quantitative bisimilarity for labelled Markov processe...
Probabilistic bisimulation is a widely studied equivalence relation for stochastic systems. However,...
International audienceWe transfer a notion of quantitative bisimilarity for labelled Markov processe...
International audienceWe transfer a notion of quantitative bisimilarity for labelled Markov processe...
International audienceWe transfer a notion of quantitative bisimilarity for labelled Markov processe...
We present a class of metrics, defined on the state space of a finite Markov decision process (MDP)...
We define a metric for measuring behavior similarity between states in a Markov decision process (MD...
Bisimulation metrics are used to estimate the behavioural distance between probabilistic systems. Th...
This paper defines action-labelled quantitative transition systems as a general framework for combin...
Bisimulation metrics are used to estimate the behavioural distance between probabilistic systems. Th...
International audienceBisimulation is a notion of behavioural equivalence on the statesof a transiti...
International audienceBisimulation is a notion of behavioural equivalence on the statesof a transiti...
We present new algorithms for computing and approximating bisimulation metrics in Markov Decision Pr...
Bisimulation metrics define a distance measure between states of a Markov decision process (MDP) bas...
International audienceWe transfer a notion of quantitative bisimilarity for labelled Markov processe...
International audienceWe transfer a notion of quantitative bisimilarity for labelled Markov processe...
Probabilistic bisimulation is a widely studied equivalence relation for stochastic systems. However,...
International audienceWe transfer a notion of quantitative bisimilarity for labelled Markov processe...
International audienceWe transfer a notion of quantitative bisimilarity for labelled Markov processe...
International audienceWe transfer a notion of quantitative bisimilarity for labelled Markov processe...
We present a class of metrics, defined on the state space of a finite Markov decision process (MDP)...
We define a metric for measuring behavior similarity between states in a Markov decision process (MD...
Bisimulation metrics are used to estimate the behavioural distance between probabilistic systems. Th...
This paper defines action-labelled quantitative transition systems as a general framework for combin...
Bisimulation metrics are used to estimate the behavioural distance between probabilistic systems. Th...