Signal temporal logic (STL) provides a user-friendly interface for defining complex tasks for robotic systems. Recent efforts aim at designing control laws or using reinforcement learning methods to find policies which guarantee satisfaction of these tasks. While the former suffer from the trade-off between task specification and computational complexity, the latter encounter difficulties in exploration as the tasks become more complex and challenging to satisfy. This paper proposes to combine the benefits of the two approaches and use an efficient prescribed performance control (PPC) base law to guide exploration within the reinforcement learning algorithm. The potential of the method is demonstrated in a simulated environment through two ...
In this paper, we propose a method to infer temporal logic behaviour models of an a priori unknown s...
We address the problem of cooperative manipulation of an object whose tasks are specified by a Signa...
Reinforcement learning (RL) depends critically on the choice of reward functions used to capture th...
Recent advances in artificial intelligence are producing fascinating results in the field of compute...
Ensuring safety and meeting temporal specifications are critical challenges for long-term robotic ta...
Reinforcement learning has been applied to many interesting problems such as the famous TD-gammon an...
Deep Reinforcement Learning (DRL) has the potential to be used for synthesizing feedback controllers...
In this paper, we aim towards providing a practical framework for learning to satisfy signal tempora...
The increasing level of autonomy and intelligence of robotic systems in carrying out complex tasks c...
Reward engineering is an important aspect of reinforcement learning. Whether or not the users’ inten...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
Motivated by the recent interest in formal methods-based control for dynamic robots, we discuss the ...
Signal temporal logic (STL) is a powerful tool for describing complex behaviors for dynamical system...
This paper addresses the problem of learning control policies for mobile robots, modeled as unknown ...
In this paper, we propose a method to infer temporal logic behaviour models of an a priori unknown s...
We address the problem of cooperative manipulation of an object whose tasks are specified by a Signa...
Reinforcement learning (RL) depends critically on the choice of reward functions used to capture th...
Recent advances in artificial intelligence are producing fascinating results in the field of compute...
Ensuring safety and meeting temporal specifications are critical challenges for long-term robotic ta...
Reinforcement learning has been applied to many interesting problems such as the famous TD-gammon an...
Deep Reinforcement Learning (DRL) has the potential to be used for synthesizing feedback controllers...
In this paper, we aim towards providing a practical framework for learning to satisfy signal tempora...
The increasing level of autonomy and intelligence of robotic systems in carrying out complex tasks c...
Reward engineering is an important aspect of reinforcement learning. Whether or not the users’ inten...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
Motivated by the recent interest in formal methods-based control for dynamic robots, we discuss the ...
Signal temporal logic (STL) is a powerful tool for describing complex behaviors for dynamical system...
This paper addresses the problem of learning control policies for mobile robots, modeled as unknown ...
In this paper, we propose a method to infer temporal logic behaviour models of an a priori unknown s...
We address the problem of cooperative manipulation of an object whose tasks are specified by a Signa...
Reinforcement learning (RL) depends critically on the choice of reward functions used to capture th...