Consider a countable state controlled Markov chain whose transition probability is specified up to an unknown parameter alpha taking values in a compact metric space A. To each alpha is associated a prespecified stationary control law zeta ( alpha ). The adaptive control law selects at each time t the control action zeta ( alpha sub(t), x sub(t)) where x sub(t) is the state and alpha sub(t) is the maximum likelihood estimate of alpha . The asymptotic behavior of this control scheme is investigated for the cases when the true parameter value alpha sub(0) does or does not belong to A, and for the case when zeta is chosen to minimize an average cost criterion. The analysis uses an appropriate extension of the notions of recurrence to nonstatio...
We introduce average cost optimal adaptive policies in a class of discrete-time Markov control proce...
A discrete-time Markov chain on the interval [0, 1] with two possible transitions (left or right) at...
A control problem for a partially observable Markov chain depending on a parameter with long run ave...
Consider a countable state controlled Markov chain whose transition probability is specified up to a...
Consider a controlled Markov chain whose transition probabilities depend upon an unknown parameter a...
Abstract. This paper considers Bayesian parameter estimation and an associated adaptive control sche...
We consider an adaptive finite state controlled Markov chain with partial state information, motivat...
We consider Markov decision processes where the state at time n+1 is a function of the state at time...
100 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1985.In this thesis we consider th...
This paper considers Bayesian parameter estimation and an associated adaptive control scheme for con...
Milito and Cruz have introduced a novel adaptive control scheme for finite Markov chains when a fini...
Milito and Cruz have introduced a novel adaptive control scheme for finite Markov chains when a fini...
AbstractThis paper is concerned with the adaptive control problem, over the infinite horizon, for pa...
summary:We study the adaptive control problem for discrete-time Markov control processes with Borel ...
We consider the problem of sequential control for a finite state and action Markovian Decision Proce...
We introduce average cost optimal adaptive policies in a class of discrete-time Markov control proce...
A discrete-time Markov chain on the interval [0, 1] with two possible transitions (left or right) at...
A control problem for a partially observable Markov chain depending on a parameter with long run ave...
Consider a countable state controlled Markov chain whose transition probability is specified up to a...
Consider a controlled Markov chain whose transition probabilities depend upon an unknown parameter a...
Abstract. This paper considers Bayesian parameter estimation and an associated adaptive control sche...
We consider an adaptive finite state controlled Markov chain with partial state information, motivat...
We consider Markov decision processes where the state at time n+1 is a function of the state at time...
100 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1985.In this thesis we consider th...
This paper considers Bayesian parameter estimation and an associated adaptive control scheme for con...
Milito and Cruz have introduced a novel adaptive control scheme for finite Markov chains when a fini...
Milito and Cruz have introduced a novel adaptive control scheme for finite Markov chains when a fini...
AbstractThis paper is concerned with the adaptive control problem, over the infinite horizon, for pa...
summary:We study the adaptive control problem for discrete-time Markov control processes with Borel ...
We consider the problem of sequential control for a finite state and action Markovian Decision Proce...
We introduce average cost optimal adaptive policies in a class of discrete-time Markov control proce...
A discrete-time Markov chain on the interval [0, 1] with two possible transitions (left or right) at...
A control problem for a partially observable Markov chain depending on a parameter with long run ave...