Synthesis from linear temporal logic (LTL) specifications provides assured controllers for systems operating in stochastic and potentially adversarial environments. Automatic synthesis tools, however, require a model of the environment to construct controllers. In this work, we introduce a model-free reinforcement learning (RL) approach to derive controllers from given LTL specifications even when the environment is completely unknown. We model the problem as a stochastic game (SG) between the controller and the adversarial environment; we then learn optimal control strategies that maximize the probability of satisfying the LTL specifications against the worst-case environment behavior. We first construct a product game using the determinis...
Abstract—We consider synthesis of control policies that maxi-mize the probability of satisfying give...
This paper presents an algorithmic framework for control synthesis of continuous dynamical systems s...
Abstract—We consider synthesis of controllers that maximize the probability of satisfying given temp...
This paper addresses the problem of learning control policies for mobile robots, modeled as unknown ...
We present a model-free reinforcement learning algorithm to synthesize control policies that maximiz...
We present a model-free reinforcement learning algorithm to synthesize control policies that maximiz...
We propose to synthesize a control policy for a Markov decision process (MDP) such that the resultin...
Abstract. We study strategy synthesis for stochastic two-player games with multiple objectives expre...
In recent years, researchers have made significant progress in devising reinforcement-learning algor...
Abstract — We propose to synthesize a control policy for a Markov decision process (MDP) such that t...
Reinforcement Learning (RL) is a widely employed machine learning architecture that has been applied...
AbstractDesign and control of computer systems that operate in uncertain, competitive or adversarial...
In this paper, we aim towards providing a practical framework for learning to satisfy signal tempora...
In this paper, we aim towards providing a practical framework for learning to satisfy signal tempora...
Reactive synthesis algorithms allow automatic construction of policies to control an environment mod...
Abstract—We consider synthesis of control policies that maxi-mize the probability of satisfying give...
This paper presents an algorithmic framework for control synthesis of continuous dynamical systems s...
Abstract—We consider synthesis of controllers that maximize the probability of satisfying given temp...
This paper addresses the problem of learning control policies for mobile robots, modeled as unknown ...
We present a model-free reinforcement learning algorithm to synthesize control policies that maximiz...
We present a model-free reinforcement learning algorithm to synthesize control policies that maximiz...
We propose to synthesize a control policy for a Markov decision process (MDP) such that the resultin...
Abstract. We study strategy synthesis for stochastic two-player games with multiple objectives expre...
In recent years, researchers have made significant progress in devising reinforcement-learning algor...
Abstract — We propose to synthesize a control policy for a Markov decision process (MDP) such that t...
Reinforcement Learning (RL) is a widely employed machine learning architecture that has been applied...
AbstractDesign and control of computer systems that operate in uncertain, competitive or adversarial...
In this paper, we aim towards providing a practical framework for learning to satisfy signal tempora...
In this paper, we aim towards providing a practical framework for learning to satisfy signal tempora...
Reactive synthesis algorithms allow automatic construction of policies to control an environment mod...
Abstract—We consider synthesis of control policies that maxi-mize the probability of satisfying give...
This paper presents an algorithmic framework for control synthesis of continuous dynamical systems s...
Abstract—We consider synthesis of controllers that maximize the probability of satisfying given temp...