We consider the problem of learning control policies in discrete-time stochastic systems which guarantee that the system stabilizes within some specified stabilization region with probability~$1$. Our approach is based on the novel notion of stabilizing ranking supermartingales (sRSMs) that we introduce in this work. Our sRSMs overcome the limitation of methods proposed in previous works whose applicability is restricted to systems in which the stabilizing region cannot be left once entered under any control policy. We present a learning procedure that learns a control policy together with an sRSM that formally certifies probability~$1$ stability, both learned as neural networks. We show that this procedure can also be adapted to formally v...
This paper addresses stochastic stabilization in case where implementation of control policies is di...
We propose a certainty-equivalence scheme for adaptive control of scalar linear systems subject to a...
We propose a stochastic MPC scheme using an optimization over the initial state for the predicted tr...
We consider the problem of learning control policies in discrete-time stochastic systems which guara...
We consider the problem of learning control policies in discrete-time stochastic systems which guara...
We consider the problem of formally verifying almost-sure (a.s.) asymptotic stability in discrete-ti...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
In this work, we address the problem of learning provably stable neural network policies for stochas...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
The problem of synthesizing stochastic explicit model predictive control policies is known to be qui...
In this work, the model predictive control problem is extended to include not only open-loop control...
We deal with nonlinear dynamical systems, consisting of a linear nominal part perturbed by model unc...
A random discrete-time system {xn}, N = 0, 1, 2, ... is called stochastically stable if for every [e...
AbstractA random discrete-time system {xn}, n = 0, 1, 2, … is called stochastically stable if for ev...
This paper addresses stochastic stabilization in case where implementation of control policies is di...
We propose a certainty-equivalence scheme for adaptive control of scalar linear systems subject to a...
We propose a stochastic MPC scheme using an optimization over the initial state for the predicted tr...
We consider the problem of learning control policies in discrete-time stochastic systems which guara...
We consider the problem of learning control policies in discrete-time stochastic systems which guara...
We consider the problem of formally verifying almost-sure (a.s.) asymptotic stability in discrete-ti...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
In this work, we address the problem of learning provably stable neural network policies for stochas...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
The problem of synthesizing stochastic explicit model predictive control policies is known to be qui...
In this work, the model predictive control problem is extended to include not only open-loop control...
We deal with nonlinear dynamical systems, consisting of a linear nominal part perturbed by model unc...
A random discrete-time system {xn}, N = 0, 1, 2, ... is called stochastically stable if for every [e...
AbstractA random discrete-time system {xn}, n = 0, 1, 2, … is called stochastically stable if for ev...
This paper addresses stochastic stabilization in case where implementation of control policies is di...
We propose a certainty-equivalence scheme for adaptive control of scalar linear systems subject to a...
We propose a stochastic MPC scheme using an optimization over the initial state for the predicted tr...