This paper proposes a relaxed control regularization with general exploration rewards to design robust feedback controls for multidimensional continuous-time stochastic exit time problems. We establish that the regularized control problem admits a Hölder continuous optimal feedback control and demonstrate that both the value function and the feedback control of the regularized control problem are Lipschitz stable with respect to parameter perturbations. Moreover, we show that a precomputed feedback relaxed control gives a robust performance in a perturbed system and derive a first-order sensitivity equation for both the value function and optimal feedback relaxed control. These stability results provide a theoretical justification for recen...
In this thesis, we investigate the linear programming framework for exit-time stochastic control pro...
Abstract—We present a reformulation of the stochastic optimal control problem in terms of KL diverge...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...
Trajectory-Centric Reinforcement Learning and Trajectory Optimization methods optimize a sequence of...
This paper proposes a new regularization technique for reinforcement learning (RL) towards making po...
For a general entropy-regularized stochastic control problem on an infinite horizon, we prove that a...
Dynamic programming is a principal method for analyzing stochastic optimal control problems. However...
We consider a nonlinear control system involving a maximal monotone map and with a priori feedback. ...
Reinforcement learning (RL) is promising for complicated stochastic nonlinear control problems. With...
In this paper, we study the optimal stopping problem in the so-called exploratory framework, in whic...
Despite its popularity in the reinforcement learning community, a provably convergent policy gradien...
The linear programming (LP) approach has a long history in the theory of approximate dynamic program...
Abstract. We investigate a class of reinforcement learning dynamics in which each player plays a “re...
This thesis investigates several topics involving robust control of stochastic nonlinear systems. Fi...
Stochastic optimal control studies the problem of sequential decision-making under uncertainty. Dyna...
In this thesis, we investigate the linear programming framework for exit-time stochastic control pro...
Abstract—We present a reformulation of the stochastic optimal control problem in terms of KL diverge...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...
Trajectory-Centric Reinforcement Learning and Trajectory Optimization methods optimize a sequence of...
This paper proposes a new regularization technique for reinforcement learning (RL) towards making po...
For a general entropy-regularized stochastic control problem on an infinite horizon, we prove that a...
Dynamic programming is a principal method for analyzing stochastic optimal control problems. However...
We consider a nonlinear control system involving a maximal monotone map and with a priori feedback. ...
Reinforcement learning (RL) is promising for complicated stochastic nonlinear control problems. With...
In this paper, we study the optimal stopping problem in the so-called exploratory framework, in whic...
Despite its popularity in the reinforcement learning community, a provably convergent policy gradien...
The linear programming (LP) approach has a long history in the theory of approximate dynamic program...
Abstract. We investigate a class of reinforcement learning dynamics in which each player plays a “re...
This thesis investigates several topics involving robust control of stochastic nonlinear systems. Fi...
Stochastic optimal control studies the problem of sequential decision-making under uncertainty. Dyna...
In this thesis, we investigate the linear programming framework for exit-time stochastic control pro...
Abstract—We present a reformulation of the stochastic optimal control problem in terms of KL diverge...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...