We develop a framework for convexifying a fairly general class of optimization problems. Under additional assumptions, we analyze the suboptimality of the solution to the con-vexified problem relative to the original non-convex problem and prove additive approx-imation guarantees. We then develop algo-rithms based on stochastic gradient methods to solve the resulting optimization problems and show bounds on convergence rates. We then extend this framework to apply to a gen-eral class of discrete-time dynamical systems. In this context, our convexification approach falls under the well-studied paradigm of risk-sensitive Markov Decision Processes. We de-rive the first known model-based and model-free policy gradient optimization algorithms wi...
Stochastic optimization, especially multistage models, is well known to be computationally excru-cia...
Abstract. Linear optimization is many times algorithmically simpler than non-linear convex optimizat...
In this work, we study discrete-time Markov decision processes (MDPs) under constraints with Borel s...
We develop a framework for convexifying a fairly general class of optimization problems. Under addit...
We study convex Constrained Markov Decision Processes (CMDPs) in which the objective is concave and ...
We consider the problem of designing policies for Markov decision processes (MDPs) with dynamic cohe...
In this paper, we show that for arbitrary stochastic linear dynamical systems, the problem of optimi...
International audienceWe investigate constrained optimal control problems for linear stochastic dyna...
We study convex Constrained Markov Decision Processes (CMDPs) in which the objective is concave and ...
This dissertation studies the applicability of convex optimization to the formal verification and sy...
Abstract — In this paper, we show that for arbitrary stochastic linear dynamical systems, the proble...
We propose a stochastic gradient framework for solving stochastic composite convex optimization prob...
We present policy gradient results within the framework of linearly-solvable MDPs. For the first tim...
We develop an approach for solving time-consistent risk-sensitive stochastic optimization problems u...
We analyze the global and local behavior of gradient-like flows under stochastic errors towards the ...
Stochastic optimization, especially multistage models, is well known to be computationally excru-cia...
Abstract. Linear optimization is many times algorithmically simpler than non-linear convex optimizat...
In this work, we study discrete-time Markov decision processes (MDPs) under constraints with Borel s...
We develop a framework for convexifying a fairly general class of optimization problems. Under addit...
We study convex Constrained Markov Decision Processes (CMDPs) in which the objective is concave and ...
We consider the problem of designing policies for Markov decision processes (MDPs) with dynamic cohe...
In this paper, we show that for arbitrary stochastic linear dynamical systems, the problem of optimi...
International audienceWe investigate constrained optimal control problems for linear stochastic dyna...
We study convex Constrained Markov Decision Processes (CMDPs) in which the objective is concave and ...
This dissertation studies the applicability of convex optimization to the formal verification and sy...
Abstract — In this paper, we show that for arbitrary stochastic linear dynamical systems, the proble...
We propose a stochastic gradient framework for solving stochastic composite convex optimization prob...
We present policy gradient results within the framework of linearly-solvable MDPs. For the first tim...
We develop an approach for solving time-consistent risk-sensitive stochastic optimization problems u...
We analyze the global and local behavior of gradient-like flows under stochastic errors towards the ...
Stochastic optimization, especially multistage models, is well known to be computationally excru-cia...
Abstract. Linear optimization is many times algorithmically simpler than non-linear convex optimizat...
In this work, we study discrete-time Markov decision processes (MDPs) under constraints with Borel s...