Lyapunov design methods are used widely in control engineering to design controllers that achieve qualitative objectives, such as stabilizing a system or maintaining a system’s state in a desired operating range. We propose a method for constructing safe, reliable reinforcement learning agents based on Lyapunov design principles. In our approach, an agent learns to control a system by switching among a number of given, base-level controllers. These controllers are designed using Lyapunov domain knowledge so that any switching policy is safe and enjoys basic performance guarantees. Our approach thus ensures qualitatively satisfactory agent behavior for virtually any reinforcement learning algorithm and at all times, including while the agent...
In safety-critical applications, autonomous agents may need to learn in an environment where mistake...
Control systems designed via learning methods, aiming at quasi-optimal solutions, typically lack sta...
Reinforcement Learning (RL) algorithms have found limited success beyond simulated applications, and...
In the many successful applications of artificial intelligence (AI) methods to real-world problems i...
This paper presents a safe learning strategy for continuous state and action spaces by utilizing Lya...
Real-world autonomous systems are often controlled using conventional model-based control methods. B...
Real-world autonomous systems are often controlled using conventional model-based control methods. B...
Model-free reinforcement learning has proved to be successful in many tasks such as robotic manipula...
Reinforcement learning (RL) is promising for complicated stochastic nonlinear control problems. With...
Reinforcement learning (RL) is promising for complicated stochastic nonlinear control problems. With...
Reinforcement learning (RL) is promising for complicated stochastic nonlinear control problems. With...
Real-world autonomous systems are often controlled using conventional model-based control methods. B...
This paper proposes an on-policy reinforcement learning (RL) control algorithm that solves the optim...
Formal verification provides a high degree of confidence in safe system operation, but only if reali...
Control systems designed via learning methods, aiming at quasi-optimal solutions, typically lack sta...
In safety-critical applications, autonomous agents may need to learn in an environment where mistake...
Control systems designed via learning methods, aiming at quasi-optimal solutions, typically lack sta...
Reinforcement Learning (RL) algorithms have found limited success beyond simulated applications, and...
In the many successful applications of artificial intelligence (AI) methods to real-world problems i...
This paper presents a safe learning strategy for continuous state and action spaces by utilizing Lya...
Real-world autonomous systems are often controlled using conventional model-based control methods. B...
Real-world autonomous systems are often controlled using conventional model-based control methods. B...
Model-free reinforcement learning has proved to be successful in many tasks such as robotic manipula...
Reinforcement learning (RL) is promising for complicated stochastic nonlinear control problems. With...
Reinforcement learning (RL) is promising for complicated stochastic nonlinear control problems. With...
Reinforcement learning (RL) is promising for complicated stochastic nonlinear control problems. With...
Real-world autonomous systems are often controlled using conventional model-based control methods. B...
This paper proposes an on-policy reinforcement learning (RL) control algorithm that solves the optim...
Formal verification provides a high degree of confidence in safe system operation, but only if reali...
Control systems designed via learning methods, aiming at quasi-optimal solutions, typically lack sta...
In safety-critical applications, autonomous agents may need to learn in an environment where mistake...
Control systems designed via learning methods, aiming at quasi-optimal solutions, typically lack sta...
Reinforcement Learning (RL) algorithms have found limited success beyond simulated applications, and...