peer reviewedThis technical note proposes a new framework for the design of continuous time, finite state space Markov processes. In particular, we propose a paradigm for selecting an optimal matrix within a pre-specified pencil of transition rate matrices. Given any transition rate matrix specifying the time-evolution of the Markov process, we propose a class of figures of merit that upper-bounds the long-term evolution of any statistical moment. We show that optimization with respect to the aforementioned class of cost functions is tractable via dualization and linear programming methods. In addition, we suggest how this approach can be used as a tool for the sub-optimal design of the master equation, with performance guarantees. Our resu...
We study the convergence of Markov Decision Processes made of a large number of objects to optimizat...
We study the problem of computing the optimal value function for a Markov decision process with posi...
We derive a new expectation maximization algorithm for policy optimization in linear Gaussian Markov...
This technical note proposes a new framework for the design of continuous time, finite state space M...
Abstract. State-based systems with discrete or continuous time are of-ten modelled with the help of ...
Optimal control provides an appealing machinery to complete complicated control tasks with limited p...
The goal of this paper is two-fold: First, we present a sensitivity point of view on the optimizatio...
Construction of stochastic models that describe the effective dynamics of observables of interest is...
Problems of sequential decisions are marked by the fact that the consequences of a decision made at ...
CSO2009, Hainan, IEEE Computer Society Proceedings, (2009) 551-555.We study the problem of construct...
We introduce a new algorithm based on linear programming for optimization of average-cost Markov dec...
This thesis is devoted to the extension of the recently developed direct comparison approach from th...
Abstract. The goal of this paper is two-fold: First, we present a sensitivity point of view on the o...
Abstract—We study the convergence of Markov decision pro-cesses, composed of a large number of objec...
We derive a new expectation maximization algorithm for policy optimization in linear Gaussian Markov...
We study the convergence of Markov Decision Processes made of a large number of objects to optimizat...
We study the problem of computing the optimal value function for a Markov decision process with posi...
We derive a new expectation maximization algorithm for policy optimization in linear Gaussian Markov...
This technical note proposes a new framework for the design of continuous time, finite state space M...
Abstract. State-based systems with discrete or continuous time are of-ten modelled with the help of ...
Optimal control provides an appealing machinery to complete complicated control tasks with limited p...
The goal of this paper is two-fold: First, we present a sensitivity point of view on the optimizatio...
Construction of stochastic models that describe the effective dynamics of observables of interest is...
Problems of sequential decisions are marked by the fact that the consequences of a decision made at ...
CSO2009, Hainan, IEEE Computer Society Proceedings, (2009) 551-555.We study the problem of construct...
We introduce a new algorithm based on linear programming for optimization of average-cost Markov dec...
This thesis is devoted to the extension of the recently developed direct comparison approach from th...
Abstract. The goal of this paper is two-fold: First, we present a sensitivity point of view on the o...
Abstract—We study the convergence of Markov decision pro-cesses, composed of a large number of objec...
We derive a new expectation maximization algorithm for policy optimization in linear Gaussian Markov...
We study the convergence of Markov Decision Processes made of a large number of objects to optimizat...
We study the problem of computing the optimal value function for a Markov decision process with posi...
We derive a new expectation maximization algorithm for policy optimization in linear Gaussian Markov...