We study the problem of learning controllers for discrete-time non-linear stochastic dynamical systems with formal reach-avoid guarantees. This work presents the first method for providing formal reach-avoid guarantees, which combine and generalize stability and safety guarantees, with a tolerable probability threshold $p\in[0,1]$ over the infinite time horizon. Our method leverages advances in machine learning literature and it represents formal certificates as neural networks. In particular, we learn a certificate in the form of a reach-avoid supermartingale (RASM), a novel notion that we introduce in this work. Our RASMs provide reachability and avoidance guarantees by imposing constraints on what can be viewed as a stochastic extension ...
Constrained reinforcement learning (CRL) has gained significant interest recently, since safety cons...
We study the problem of \textit{safe control of linear dynamical systems corrupted with non-stochast...
In this work, the model predictive control problem is extended to include not only open-loop control...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
We consider the problem of learning control policies in discrete-time stochastic systems which guara...
We consider the problem of learning control policies in discrete-time stochastic systems which guara...
Stability and safety are crucial in safety-critical control of dynamical systems. The reach-avoid-st...
We consider the problem of learning control policies in discrete-time stochastic systems which guara...
Reinforcement learning (RL) is a promising approach. However, success is limited to real-world appli...
This paper presents an end-to-end framework for safe learning-based control (LbC) using nonlinear st...
This paper presents an end-to-end framework for safe learning-based control (LbC) using nonlinear st...
This paper develops a model-based reinforcement learning (MBRL) framework for learning online the va...
It is quite challenging to ensure the safety of reinforcement learning (RL) agents in an unknown and...
Constrained reinforcement learning (CRL) has gained significant interest recently, since safety cons...
We study the problem of \textit{safe control of linear dynamical systems corrupted with non-stochast...
In this work, the model predictive control problem is extended to include not only open-loop control...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
We consider the problem of learning control policies in discrete-time stochastic systems which guara...
We consider the problem of learning control policies in discrete-time stochastic systems which guara...
Stability and safety are crucial in safety-critical control of dynamical systems. The reach-avoid-st...
We consider the problem of learning control policies in discrete-time stochastic systems which guara...
Reinforcement learning (RL) is a promising approach. However, success is limited to real-world appli...
This paper presents an end-to-end framework for safe learning-based control (LbC) using nonlinear st...
This paper presents an end-to-end framework for safe learning-based control (LbC) using nonlinear st...
This paper develops a model-based reinforcement learning (MBRL) framework for learning online the va...
It is quite challenging to ensure the safety of reinforcement learning (RL) agents in an unknown and...
Constrained reinforcement learning (CRL) has gained significant interest recently, since safety cons...
We study the problem of \textit{safe control of linear dynamical systems corrupted with non-stochast...
In this work, the model predictive control problem is extended to include not only open-loop control...