In constrained Markov decision problems, optimal policies are often found to depend on quantities which are not readily available due either to insufficient knowledge of the model parameters or to computational difficulties. Thia motivates the on-line estimation (or computation) problem investigated in this paper in the context of a single parameter family of finite-state Markov chains. The computation is implemented through an algorithm of the Stochastic Approximations type which recursively generates on-line estimates for the unknown value. A useful methodology is outlined for investigating the strong consistency of the algorithm and the proof is carried out under a set of simplifying assumptions in order to illustrate the key ideas unenc...
International audienceMarkov chain modeling often suffers from the curse of dimensionality problems ...
International audienceWe propose several algorithms to obtain bounds based on Censored Markov Chains...
Another approach to finite differences is the well developed Markov Chain Approximation (MCA) of Kus...
This paper develops an a.s. convergence theory for a class of projected Stochastic Approximations dr...
AbstractThis paper develops an a.s. convergence theory for a class of projected stochastic approxima...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
The paper develops a procedure for finding a discrete-valued Markov chain whose sample paths approxi...
The thesis develops methods to solve discrete-time finite-state partially observable Markov decision...
For a countable-state Markov decision process we introduce an embedding which produces a finite-stat...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
We study the convergence properties of the projected stochastic approximation (SA) algo-rithm used t...
The goal of this work is to formally abstract a Markov process evolving in discrete time over a gene...
The aim of this paper is to approximate a Finite-State Markov (FSM) process by another process defin...
The (optimal) design of many engineering systems can be adequately recast as a Markov decision proce...
A class of Markov decision processes is considered with a finite state and action space and with an ...
International audienceMarkov chain modeling often suffers from the curse of dimensionality problems ...
International audienceWe propose several algorithms to obtain bounds based on Censored Markov Chains...
Another approach to finite differences is the well developed Markov Chain Approximation (MCA) of Kus...
This paper develops an a.s. convergence theory for a class of projected Stochastic Approximations dr...
AbstractThis paper develops an a.s. convergence theory for a class of projected stochastic approxima...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
The paper develops a procedure for finding a discrete-valued Markov chain whose sample paths approxi...
The thesis develops methods to solve discrete-time finite-state partially observable Markov decision...
For a countable-state Markov decision process we introduce an embedding which produces a finite-stat...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
We study the convergence properties of the projected stochastic approximation (SA) algo-rithm used t...
The goal of this work is to formally abstract a Markov process evolving in discrete time over a gene...
The aim of this paper is to approximate a Finite-State Markov (FSM) process by another process defin...
The (optimal) design of many engineering systems can be adequately recast as a Markov decision proce...
A class of Markov decision processes is considered with a finite state and action space and with an ...
International audienceMarkov chain modeling often suffers from the curse of dimensionality problems ...
International audienceWe propose several algorithms to obtain bounds based on Censored Markov Chains...
Another approach to finite differences is the well developed Markov Chain Approximation (MCA) of Kus...