In industrial applications, the processes of optimal sequential decision making are naturally formulated and optimized within a standard setting of Markov decision theory. In practice, however, decisions must be made under incomplete and uncertain information about parameters and transition probabilities. This situation occurs when a system may suffer a regime switch changing not only the transition probabilities but also the control costs. After such an event, the effect of the actions may turn to the opposite, meaning that all strategies must be revised. Due to practical importance of this problem, a variety of methods has been suggested, ranging from incorporating regime switches into Markov dynamics to numerous concepts addressing model...
Partially observable Markov decision processes (POMDPs) provide a natural and principled framework t...
Optimal control in non-stationary Markov decision processes (MDP) is a challenging problem. The aim ...
When modeling real-world decision-theoretic planning problems in the Markov decision process (MDP) f...
In industrial applications, the processes of optimal sequential decision making are naturally formul...
We propose a new method for learning policies for large, partially observable Markov decision proces...
We propose a new method for learning policies for large, partially observ-able Markov decision proce...
Markov Decision Processes (MDPs) constitute a mathematical framework for modelling systems featuring...
This dissertation studies the applicability of convex optimization to the formal verification and sy...
Markov decision process (MDP) models are widely used for modeling sequential decision-making problem...
Markov Decision Problems (MDPs) are the foundation for many problems that are of interest to researc...
<p>This dissertation describes sequential decision making problems in non-stationary environments. O...
When modeling real-world decision-theoretic planning problems in the Markov Decision Process (MDP) f...
Abstract — This paper presents a new robust decision-making algorithm that accounts for model uncert...
We consider the control of a Markov decision process (MDP) that undergoes an abrupt change in its tr...
© 2016 Informa UK Limited, trading as Taylor & Francis Group. In industrial applications, optimal co...
Partially observable Markov decision processes (POMDPs) provide a natural and principled framework t...
Optimal control in non-stationary Markov decision processes (MDP) is a challenging problem. The aim ...
When modeling real-world decision-theoretic planning problems in the Markov decision process (MDP) f...
In industrial applications, the processes of optimal sequential decision making are naturally formul...
We propose a new method for learning policies for large, partially observable Markov decision proces...
We propose a new method for learning policies for large, partially observ-able Markov decision proce...
Markov Decision Processes (MDPs) constitute a mathematical framework for modelling systems featuring...
This dissertation studies the applicability of convex optimization to the formal verification and sy...
Markov decision process (MDP) models are widely used for modeling sequential decision-making problem...
Markov Decision Problems (MDPs) are the foundation for many problems that are of interest to researc...
<p>This dissertation describes sequential decision making problems in non-stationary environments. O...
When modeling real-world decision-theoretic planning problems in the Markov Decision Process (MDP) f...
Abstract — This paper presents a new robust decision-making algorithm that accounts for model uncert...
We consider the control of a Markov decision process (MDP) that undergoes an abrupt change in its tr...
© 2016 Informa UK Limited, trading as Taylor & Francis Group. In industrial applications, optimal co...
Partially observable Markov decision processes (POMDPs) provide a natural and principled framework t...
Optimal control in non-stationary Markov decision processes (MDP) is a challenging problem. The aim ...
When modeling real-world decision-theoretic planning problems in the Markov decision process (MDP) f...