The parameters for a Markov Decision Process (MDP) often cannot be specified exactly. Uncertain MDPs (UMDPs) capture this model ambiguity by defining sets which the parameters belong to. Minimax regret has been proposed as an objective for planning in UMDPs to find robust policies which are not overly conservative. In this work, we focus on planning for Stochastic Shortest Path (SSP) UMDPs with uncertain cost and transition functions. We introduce a Bellman equation to compute the regret for a policy. We propose a dynamic programming algorithm that utilises the regret Bellman equation, and show that it optimises minimax regret exactly for UMDPs with independent uncertainties. For coupled uncertainties, we extend our approach to use options ...
Uncertainty in optimization is not a new ingredient. Diverse models considering uncertainty have bee...
We consider large-scale Markov decision pro-cesses (MDPs) with parameter uncertainty, un-der the rob...
Candia-Vejar, A (reprint author), Univ Talca, Modeling & Ind Management Dept, Curico, Chile.Uncertai...
The parameters for a Markov Decision Process (MDP) often cannot be specified exactly. Uncertain MDPs...
The parameters for a Markov Decision Process (MDP) often cannot be specified exactly. Uncertain MDPs...
In this paper, we seek robust policies for uncertain Markov Decision Processes (MDPs). Most robust o...
© 2017 AI Access Foundation. All rights reserved. Markov Decision Processes (MDPs) are an effective ...
The precise specification of reward functions for Markov decision processes (MDPs) is often extremel...
Abstract — We consider decision making in a Markovian setup where the reward parameters are not know...
Markov decision processes (MDP) is a standard modeling tool for sequential decision making in a dyna...
Markov Decision Problems, MDPs offer an effective mechanism for planning under uncertainty. However,...
Markov Decision Problems, MDPs offer an effective mechanism for planning under uncertainty. However,...
Markov Decision Problems, MDPs offer an effective mech-anism for planning under uncertainty. However...
National Research Foundation (NRF) Singapore under Singapore-MIT Alliance for Research and Technolog...
In this chapter a class of scheduling problems with uncertain parameters is dis-cussed. The uncertai...
Uncertainty in optimization is not a new ingredient. Diverse models considering uncertainty have bee...
We consider large-scale Markov decision pro-cesses (MDPs) with parameter uncertainty, un-der the rob...
Candia-Vejar, A (reprint author), Univ Talca, Modeling & Ind Management Dept, Curico, Chile.Uncertai...
The parameters for a Markov Decision Process (MDP) often cannot be specified exactly. Uncertain MDPs...
The parameters for a Markov Decision Process (MDP) often cannot be specified exactly. Uncertain MDPs...
In this paper, we seek robust policies for uncertain Markov Decision Processes (MDPs). Most robust o...
© 2017 AI Access Foundation. All rights reserved. Markov Decision Processes (MDPs) are an effective ...
The precise specification of reward functions for Markov decision processes (MDPs) is often extremel...
Abstract — We consider decision making in a Markovian setup where the reward parameters are not know...
Markov decision processes (MDP) is a standard modeling tool for sequential decision making in a dyna...
Markov Decision Problems, MDPs offer an effective mechanism for planning under uncertainty. However,...
Markov Decision Problems, MDPs offer an effective mechanism for planning under uncertainty. However,...
Markov Decision Problems, MDPs offer an effective mech-anism for planning under uncertainty. However...
National Research Foundation (NRF) Singapore under Singapore-MIT Alliance for Research and Technolog...
In this chapter a class of scheduling problems with uncertain parameters is dis-cussed. The uncertai...
Uncertainty in optimization is not a new ingredient. Diverse models considering uncertainty have bee...
We consider large-scale Markov decision pro-cesses (MDPs) with parameter uncertainty, un-der the rob...
Candia-Vejar, A (reprint author), Univ Talca, Modeling & Ind Management Dept, Curico, Chile.Uncertai...