Abstract. This paper considers Bayesian parameter estimation and an associated adaptive control scheme for controlled Markov chains and diffu-sions with time-averaged cost. Asymptotic behaviour of the posterior law of the parameter given the observed trajectory is analyzed. This analysis sug-gests a “cost-biased ” estimation scheme and associated self-tuning adaptive control. This is shown to be asymptotically optimal in the almost sure sense. I. Introduction. A popular scheme for adaptive control is the so-called “self-tuning ” control wherein a parameterized family of system models is presupposed and the parameter is estimated “on-line”. One then uses at each time instant that control which would have been the optimal choice for the curre...
The self-tuning method of adaptive control for diffusions consists of estimating the unknown paramet...
Abstract. Three distinct controlled ergodic Markov models are considered here. The models are a disc...
This work addresses the problem of estimating the optimal value function in a Markov Decision Proces...
This paper considers Bayesian parameter estimation and an associated adaptive control scheme for con...
Consider a countable state controlled Markov chain whose transition probability is specified up to a...
100 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1985.In this thesis we consider th...
We consider Markov decision processes where the state at time n+1 is a function of the state at time...
This thesis analyzes the Bayesian control law for adaptive control proposed by Ortega and Braun. Th...
We consider an adaptive finite state controlled Markov chain with partial state information, motivat...
AbstractThis paper is concerned with the adaptive control problem, over the infinite horizon, for pa...
Consider a controlled Markov chain whose transition probabilities depend upon an unknown parameter a...
We study Bayesian optimal control of a general class of smoothly parameterized Markov deci-sion prob...
This work addresses the problem of estimating the optimal value function in a MarkovDecision Process...
The self-tuning method of adaptive control for diffusions consists of estimating the unknown paramet...
We consider the problem of sequential control for a finite state and action Markovian Decision Proce...
The self-tuning method of adaptive control for diffusions consists of estimating the unknown paramet...
Abstract. Three distinct controlled ergodic Markov models are considered here. The models are a disc...
This work addresses the problem of estimating the optimal value function in a Markov Decision Proces...
This paper considers Bayesian parameter estimation and an associated adaptive control scheme for con...
Consider a countable state controlled Markov chain whose transition probability is specified up to a...
100 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1985.In this thesis we consider th...
We consider Markov decision processes where the state at time n+1 is a function of the state at time...
This thesis analyzes the Bayesian control law for adaptive control proposed by Ortega and Braun. Th...
We consider an adaptive finite state controlled Markov chain with partial state information, motivat...
AbstractThis paper is concerned with the adaptive control problem, over the infinite horizon, for pa...
Consider a controlled Markov chain whose transition probabilities depend upon an unknown parameter a...
We study Bayesian optimal control of a general class of smoothly parameterized Markov deci-sion prob...
This work addresses the problem of estimating the optimal value function in a MarkovDecision Process...
The self-tuning method of adaptive control for diffusions consists of estimating the unknown paramet...
We consider the problem of sequential control for a finite state and action Markovian Decision Proce...
The self-tuning method of adaptive control for diffusions consists of estimating the unknown paramet...
Abstract. Three distinct controlled ergodic Markov models are considered here. The models are a disc...
This work addresses the problem of estimating the optimal value function in a Markov Decision Proces...