We consider the problem of learning control policies in discrete-time stochastic systems which guarantee that the system stabilizes within some specified stabilization region with probability 1. Our approach is based on the novel notion of stabilizing ranking supermartingales (sRSMs) that we introduce in this work. Our sRSMs overcome the limitation of methods proposed in previous works whose applicability is restricted to systems in which the stabilizing region cannot be left once entered under any control policy. We present a learning procedure that learns a control policy together with an sRSM that formally certifies probability 1 stability, both learned as neural networks. We show that this procedure can also be adapted to formally verif...
This paper considers a class of reinforcement-learning that belongs to the family of Learning Automa...
We deal with nonlinear dynamical systems, consisting of a linear nominal part plus model uncertainti...
This paper considers a class of reinforcement-based learning (namely, perturbed learning automata) a...
We consider the problem of learning control policies in discrete-time stochastic systems which guara...
We consider the problem of formally verifying almost-sure (a.s.) asymptotic stability in discrete-ti...
In this work, we address the problem of learning provably stable neural network policies for stochas...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
We deal with nonlinear dynamical systems, consisting of a linear nominal part perturbed by model unc...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
The problem of synthesizing stochastic explicit model predictive control policies is known to be qui...
A simulation-based algorithm for learning good policies for a discrete-time stochastic control proce...
In this work, the model predictive control problem is extended to include not only open-loop control...
We are interested in understanding stability (almost sure boundedness) of stochastic approximation a...
The paper studies a stochastic extension of continuous recurrent neural networks and analyzes gradie...
This paper considers a class of reinforcement-learning that belongs to the family of Learning Automa...
We deal with nonlinear dynamical systems, consisting of a linear nominal part plus model uncertainti...
This paper considers a class of reinforcement-based learning (namely, perturbed learning automata) a...
We consider the problem of learning control policies in discrete-time stochastic systems which guara...
We consider the problem of formally verifying almost-sure (a.s.) asymptotic stability in discrete-ti...
In this work, we address the problem of learning provably stable neural network policies for stochas...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
We deal with nonlinear dynamical systems, consisting of a linear nominal part perturbed by model unc...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
The problem of synthesizing stochastic explicit model predictive control policies is known to be qui...
A simulation-based algorithm for learning good policies for a discrete-time stochastic control proce...
In this work, the model predictive control problem is extended to include not only open-loop control...
We are interested in understanding stability (almost sure boundedness) of stochastic approximation a...
The paper studies a stochastic extension of continuous recurrent neural networks and analyzes gradie...
This paper considers a class of reinforcement-learning that belongs to the family of Learning Automa...
We deal with nonlinear dynamical systems, consisting of a linear nominal part plus model uncertainti...
This paper considers a class of reinforcement-based learning (namely, perturbed learning automata) a...