We consider the problem of designing policies for Markov decision processes (MDPs) with dynamic coherent risk objectives and constraints. We begin by formulating the problem in a Lagrangian framework. Under the assumption that the risk objectives and constraints can be represented by a Markov risk transition mapping, we propose an optimization-based method to synthesize Markovian policies that lower-bound the constrained risk-averse problem. We demonstrate that the formulated optimization problems are in the form of difference convex programs (DCPs) and can be solved by the disciplined convex-concave programming (DCCP) framework. We show that these results generalize linear programs for constrained MDPs with total discounted expected costs ...
In this work, we study discrete-time Markov decision processes (MDPs) under constraints with Borel s...
We study the synthesis of robust optimal control policies for Markov decision processes with transit...
We consider the problem of producing lower bounds on the optimal cost-to-go function of a Markov dec...
We consider the problem of designing policies for Markov decision processes (MDPs) with dynamic cohe...
We consider the problem of designing policies for partially observable Markov decision processes (PO...
We consider the problem of risk-sensitive motion planning in the presence of randomly moving obstacl...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Sloan School of Management, Operations Resea...
Markov decision processes (MDPs) are the defacto framework for sequential decision making in the pre...
In many sequential decision-making problems we may want to manage risk by minimizing some measure of...
While Markov Decision Processes (MDPs) have been shown to be effective models for planning under unc...
While Markov Decision Processes (MDPs) have been shown to be effective models for planning under unc...
We develop a framework for convexifying a fairly general class of optimization problems. Under addit...
We develop a framework for convexifying a fairly general class of optimization problems. Under addit...
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are thos...
While Markov Decision Processes (MDPs) have been shown to be effective models for planning under unc...
In this work, we study discrete-time Markov decision processes (MDPs) under constraints with Borel s...
We study the synthesis of robust optimal control policies for Markov decision processes with transit...
We consider the problem of producing lower bounds on the optimal cost-to-go function of a Markov dec...
We consider the problem of designing policies for Markov decision processes (MDPs) with dynamic cohe...
We consider the problem of designing policies for partially observable Markov decision processes (PO...
We consider the problem of risk-sensitive motion planning in the presence of randomly moving obstacl...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Sloan School of Management, Operations Resea...
Markov decision processes (MDPs) are the defacto framework for sequential decision making in the pre...
In many sequential decision-making problems we may want to manage risk by minimizing some measure of...
While Markov Decision Processes (MDPs) have been shown to be effective models for planning under unc...
While Markov Decision Processes (MDPs) have been shown to be effective models for planning under unc...
We develop a framework for convexifying a fairly general class of optimization problems. Under addit...
We develop a framework for convexifying a fairly general class of optimization problems. Under addit...
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are thos...
While Markov Decision Processes (MDPs) have been shown to be effective models for planning under unc...
In this work, we study discrete-time Markov decision processes (MDPs) under constraints with Borel s...
We study the synthesis of robust optimal control policies for Markov decision processes with transit...
We consider the problem of producing lower bounds on the optimal cost-to-go function of a Markov dec...