100 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1985.In this thesis we consider the control of a dynamic system modeled as a Markov chain. The transition probability matrix of the Markov chain depends on the control u and also on an unknown parameter (alpha)('o). The unknown parameter belongs to a given finite set A. The performance of the system is measured by a long run average cost criterion. A direct approach to the optimization of the performance is not feasible. A common procedure calls for an on-line estimation of the unknown parameter and the minimization of the cost functional using the estimate in lieu of the true parameter. This certainty equivalence (CE) solution may fail to achieve optimal performance.We give ...
A control problem for a partially observable Markov chain depending on a parameter with long run ave...
We consider the problem of sequential control for a finite state and action Markovian Decision Proce...
The purpose of flow control is to reduce the congestion experienced in many systems, such as data ne...
Consider a controlled Markov chain whose transition probabilities depend upon an unknown parameter a...
Abstract. This paper considers Bayesian parameter estimation and an associated adaptive control sche...
AbstractThis paper is concerned with the adaptive control problem, over the infinite horizon, for pa...
We consider Markov decision processes where the state at time n+1 is a function of the state at time...
Consider a countable state controlled Markov chain whose transition probability is specified up to a...
We consider an adaptive finite state controlled Markov chain with partial state information, motivat...
In [5], the authors showed that threshold policies solve an optimal flow control problem for discret...
Abstract. Three distinct controlled ergodic Markov models are considered here. The models are a disc...
A partially observed stochastic system is described by a discrete time pair of Markov processes. The...
This is the published version, also available here: http://dx.doi.org/10.1137/S0363012996298369.Thre...
We study the problem of long-run average cost control of Markov chains conditioned on a rare event. ...
Note:Iterative algorithms are proposed for adaptive long-run average cost control of finite state Ma...
A control problem for a partially observable Markov chain depending on a parameter with long run ave...
We consider the problem of sequential control for a finite state and action Markovian Decision Proce...
The purpose of flow control is to reduce the congestion experienced in many systems, such as data ne...
Consider a controlled Markov chain whose transition probabilities depend upon an unknown parameter a...
Abstract. This paper considers Bayesian parameter estimation and an associated adaptive control sche...
AbstractThis paper is concerned with the adaptive control problem, over the infinite horizon, for pa...
We consider Markov decision processes where the state at time n+1 is a function of the state at time...
Consider a countable state controlled Markov chain whose transition probability is specified up to a...
We consider an adaptive finite state controlled Markov chain with partial state information, motivat...
In [5], the authors showed that threshold policies solve an optimal flow control problem for discret...
Abstract. Three distinct controlled ergodic Markov models are considered here. The models are a disc...
A partially observed stochastic system is described by a discrete time pair of Markov processes. The...
This is the published version, also available here: http://dx.doi.org/10.1137/S0363012996298369.Thre...
We study the problem of long-run average cost control of Markov chains conditioned on a rare event. ...
Note:Iterative algorithms are proposed for adaptive long-run average cost control of finite state Ma...
A control problem for a partially observable Markov chain depending on a parameter with long run ave...
We consider the problem of sequential control for a finite state and action Markovian Decision Proce...
The purpose of flow control is to reduce the congestion experienced in many systems, such as data ne...