We develop a framework for convexifying a fairly general class of optimization problems. Under additional assumptions, we analyze the suboptimality of the solution to the con-vexified problem relative to the original non-convex problem and prove additive approx-imation guarantees. We then develop algo-rithms based on stochastic gradient methods to solve the resulting optimization problems and show bounds on convergence rates. We then extend this framework to apply to a gen-eral class of discrete-time dynamical systems. In this context, our convexification approach falls under the well-studied paradigm of risk-sensitive Markov Decision Processes. We de-rive the first known model-based and model-free policy gradient optimization algorithms wi...
We study the risk-sensitive exponential cost MDP formulation and develop a trajectory-based gradient...
In many sequential decision-making problems we may want to manage risk by minimizing some measure of...
We analyze the global and local behavior of gradient-like flows under stochastic errors towards the ...
We develop a framework for convexifying a fairly general class of optimization problems. Under addit...
In this paper, we show that for arbitrary stochastic linear dynamical systems, the problem of optimi...
We consider the problem of designing policies for Markov decision processes (MDPs) with dynamic cohe...
We study convex Constrained Markov Decision Processes (CMDPs) in which the objective is concave and ...
Abstract — In this paper, we show that for arbitrary stochastic linear dynamical systems, the proble...
This dissertation studies the applicability of convex optimization to the formal verification and sy...
We study convex Constrained Markov Decision Processes (CMDPs) in which the objective is concave and ...
We present policy gradient results within the framework of linearly-solvable MDPs. For the first tim...
International audienceWe investigate constrained optimal control problems for linear stochastic dyna...
We develop an approach for solving time-consistent risk-sensitive stochastic optimization problems u...
Stochastic optimization, especially multistage models, is well known to be computationally excru-cia...
We propose a stochastic gradient framework for solving stochastic composite convex optimization prob...
We study the risk-sensitive exponential cost MDP formulation and develop a trajectory-based gradient...
In many sequential decision-making problems we may want to manage risk by minimizing some measure of...
We analyze the global and local behavior of gradient-like flows under stochastic errors towards the ...
We develop a framework for convexifying a fairly general class of optimization problems. Under addit...
In this paper, we show that for arbitrary stochastic linear dynamical systems, the problem of optimi...
We consider the problem of designing policies for Markov decision processes (MDPs) with dynamic cohe...
We study convex Constrained Markov Decision Processes (CMDPs) in which the objective is concave and ...
Abstract — In this paper, we show that for arbitrary stochastic linear dynamical systems, the proble...
This dissertation studies the applicability of convex optimization to the formal verification and sy...
We study convex Constrained Markov Decision Processes (CMDPs) in which the objective is concave and ...
We present policy gradient results within the framework of linearly-solvable MDPs. For the first tim...
International audienceWe investigate constrained optimal control problems for linear stochastic dyna...
We develop an approach for solving time-consistent risk-sensitive stochastic optimization problems u...
Stochastic optimization, especially multistage models, is well known to be computationally excru-cia...
We propose a stochastic gradient framework for solving stochastic composite convex optimization prob...
We study the risk-sensitive exponential cost MDP formulation and develop a trajectory-based gradient...
In many sequential decision-making problems we may want to manage risk by minimizing some measure of...
We analyze the global and local behavior of gradient-like flows under stochastic errors towards the ...