Value-function approximation is investigated for the solution via Dynamic Programming (DP) of continuous-state sequential N-stage decision problems, in which the reward to be maximized has an additive structure over a finite number of stages. Conditions that guarantee smoothness properties of the value function at each stage are derived. These properties are exploited to approximate such functions by means of certain nonlinear approximation schemes, which include splines of suitable order and Gaussian radial-basis networks with variable centers and widths. The accuracies of suboptimal solutions obtained by combining DP with these approximation tools are estimated. The results provide insights into the successful performances appeared in the...
Abstract Suboptimal solutions to infinite-horizon dynamic optimization problems with continuous stat...
Many sequential decision problems can be formulated as Markov decision processes (MDPs) where the op...
This paper studies fitted value iteration for continuous state dynamic programming using nonexpansiv...
Sequential decision problems are considered, where a reward additive over a number of stages has to ...
Sequential decision problems are considered, where a reward additive over a number of stages has to ...
Stochastic optimization problems with an objective function that is additive over a finite number of...
International audienceIn any complex or large scale sequential decision making problem, there is a c...
This paper studies fitted value iteration for continuous state numerical dynamic programming using n...
We compare alternative numerical methods for approximating solutions to continuous-state dynamic pro...
ABSTRACT. This paper studies fitted value iteration for contin-uous state numerical dynamic programm...
Connections between function approximation and classes of functional optimization problems, whose ad...
The approximation of the optimal policy functions is investigated for dynamic optimization problems ...
AbstractA sequential decision model is developed in the context of which three principles of optimal...
ISSN 0819-2642 ISBN 0 7340 2618 8 Research Paper Number 961This paper studies fitted value iteration...
Abstract Suboptimal solutions to infinite-horizon dynamic optimization problems with continuous stat...
Abstract Suboptimal solutions to infinite-horizon dynamic optimization problems with continuous stat...
Many sequential decision problems can be formulated as Markov decision processes (MDPs) where the op...
This paper studies fitted value iteration for continuous state dynamic programming using nonexpansiv...
Sequential decision problems are considered, where a reward additive over a number of stages has to ...
Sequential decision problems are considered, where a reward additive over a number of stages has to ...
Stochastic optimization problems with an objective function that is additive over a finite number of...
International audienceIn any complex or large scale sequential decision making problem, there is a c...
This paper studies fitted value iteration for continuous state numerical dynamic programming using n...
We compare alternative numerical methods for approximating solutions to continuous-state dynamic pro...
ABSTRACT. This paper studies fitted value iteration for contin-uous state numerical dynamic programm...
Connections between function approximation and classes of functional optimization problems, whose ad...
The approximation of the optimal policy functions is investigated for dynamic optimization problems ...
AbstractA sequential decision model is developed in the context of which three principles of optimal...
ISSN 0819-2642 ISBN 0 7340 2618 8 Research Paper Number 961This paper studies fitted value iteration...
Abstract Suboptimal solutions to infinite-horizon dynamic optimization problems with continuous stat...
Abstract Suboptimal solutions to infinite-horizon dynamic optimization problems with continuous stat...
Many sequential decision problems can be formulated as Markov decision processes (MDPs) where the op...
This paper studies fitted value iteration for continuous state dynamic programming using nonexpansiv...