This paper proposes a high-level reinforcement learning (RL) control system for solving the action selection problem of an autonomous robot. Although the dominant approach, when using RL, has been to apply value function based algorithms, the system here detailed is characterized by the use of direct policy search methods. Rather than approximating a value function, these methodologies approximate a policy using an independent function approximator with its own parameters, trying to maximize the future expected reward. The policy based algorithm presented in this paper is used for learning the internal state/action mapping of a behavior. In this preliminary work, we demonstrate its feasibility with simulated experiments using the underwater...
Many motor skills in humanoid robotics can be learned using parametrized motor primitives. While suc...
In robotics, controllers make the robot solve a task within a specific context. The context can des...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
Abstract—This paper proposes a high-level Reinforcement Learning (RL) control system for solving the...
Abstract: Autonomous Underwater Vehicles (AUV) represent a challenging control problem with complex,...
Autonomous underwater vehicles (AUV) represent a challenging control problem with complex, noisy, dy...
This paper proposes a field application of a high-level reinforcement learning (RL) control system f...
represent a challenging control problem with complex, noisy, dynamics. Nowadays, not only the contin...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
As most action generation problems of autonomous robots can be phrased in terms of sequential decisi...
This paper investigates how to make improved action selection for online policy learning in robotic ...
This paper describes work in progress on a neural-based reinforcement learning architecture for the ...
A summary of the state-of-the-art reinforcement learning in robotics is given, in terms of both algo...
Following the principle of human skill learning, robot acquiring skill is a process similar to human...
Reinforcement learning is a model-free technique to solve decision-making problems by learning the b...
Many motor skills in humanoid robotics can be learned using parametrized motor primitives. While suc...
In robotics, controllers make the robot solve a task within a specific context. The context can des...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...
Abstract—This paper proposes a high-level Reinforcement Learning (RL) control system for solving the...
Abstract: Autonomous Underwater Vehicles (AUV) represent a challenging control problem with complex,...
Autonomous underwater vehicles (AUV) represent a challenging control problem with complex, noisy, dy...
This paper proposes a field application of a high-level reinforcement learning (RL) control system f...
represent a challenging control problem with complex, noisy, dynamics. Nowadays, not only the contin...
General autonomy is at the forefront of robotic research and practice. Earlier research has enabled ...
As most action generation problems of autonomous robots can be phrased in terms of sequential decisi...
This paper investigates how to make improved action selection for online policy learning in robotic ...
This paper describes work in progress on a neural-based reinforcement learning architecture for the ...
A summary of the state-of-the-art reinforcement learning in robotics is given, in terms of both algo...
Following the principle of human skill learning, robot acquiring skill is a process similar to human...
Reinforcement learning is a model-free technique to solve decision-making problems by learning the b...
Many motor skills in humanoid robotics can be learned using parametrized motor primitives. While suc...
In robotics, controllers make the robot solve a task within a specific context. The context can des...
Most policy search algorithms require thousands of training episodes to find an effective policy, wh...