This paper addresses the problem of approximating the set of all solutions for Multi-objective Markov Decision Processes. We show that in the vast majority of interesting cases, the number of solutions is exponential or even infinite. In order to overcome this difficulty we propose to approximate the set of all solutions by means of a limited precision approach based on White’s multi-objective value-iteration dynamic programming algorithm. We prove that the number of calculated solutions is tractable and show experimentally that the solutions obtained are a good approximation of the true Pareto front.Funding for open access charge: Universidad de Málaga / CBUA. Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nat...
We consider stochastic planning problems that involve mul-tiple objectives such as minimizing task c...
We study and provide efficient algorithms for multi-objective model checking problems for Markov Dec...
We study and provide efficient algorithms for multi-objective model checkingproblems for Markov Deci...
Abstract — This paper describes a novel algorithm called CON-MODP for computing Pareto optimal polic...
This paper describes a novel algorithm called CONMODP for computing Pareto optimal policies for det...
This work describes MPQ-learning, an temporal-difference method that approximates the set of all non...
Problems involving optimal sequential making in uncertain dynamic systems arise in domains such as e...
We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives...
This paper is about learning a continuous approximation of the Pareto frontier in Multi-Objective Ma...
We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives...
Planning under uncertainty is a central problem in developing intelligent autonomous systems. The tr...
Iteratively solving a set of linear programs (LPs) is a common strategy for solving various decision...
We study Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) functions. We...
This paper is about learning a continuous approximation of the Pareto frontier in Multi-Objective Ma...
We consider Markov decision processes (MDPs) with multiple limit-average (ormean-payoff) objectives....
We consider stochastic planning problems that involve mul-tiple objectives such as minimizing task c...
We study and provide efficient algorithms for multi-objective model checking problems for Markov Dec...
We study and provide efficient algorithms for multi-objective model checkingproblems for Markov Deci...
Abstract — This paper describes a novel algorithm called CON-MODP for computing Pareto optimal polic...
This paper describes a novel algorithm called CONMODP for computing Pareto optimal policies for det...
This work describes MPQ-learning, an temporal-difference method that approximates the set of all non...
Problems involving optimal sequential making in uncertain dynamic systems arise in domains such as e...
We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives...
This paper is about learning a continuous approximation of the Pareto frontier in Multi-Objective Ma...
We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives...
Planning under uncertainty is a central problem in developing intelligent autonomous systems. The tr...
Iteratively solving a set of linear programs (LPs) is a common strategy for solving various decision...
We study Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) functions. We...
This paper is about learning a continuous approximation of the Pareto frontier in Multi-Objective Ma...
We consider Markov decision processes (MDPs) with multiple limit-average (ormean-payoff) objectives....
We consider stochastic planning problems that involve mul-tiple objectives such as minimizing task c...
We study and provide efficient algorithms for multi-objective model checking problems for Markov Dec...
We study and provide efficient algorithms for multi-objective model checkingproblems for Markov Deci...