Coordination of multiple behaviors independently ob-tained by a reinforcement learning method is one of the issues in order for the method to be scaled to larg-er and more complex robot learning tasks. Direct com-bination of all the state spaces for individual modules (subtasks) needs enormous learning time, and it causes hidden states. This paper presents a method of modu-lar learning which coordinates multiple behaviors tak-ing account of a trade-off between learning time and performance. First, in order to reduce the learning time the whole state space is classified into two cate-gories based on the action values separately obtained by Q learning: the area where one of the learned behav-iors is directly applicable (no more learning area)...
Abstract—Life-time development of behavior learning seems based on not only self-learning architectu...
Abstract: This paper focuses on two issues on learning and development; a problem of state-action sp...
This paper proposes an efficient method of robot learning by which a set of pairs of a state and an ...
This paper proposes a method that acquires the pur-posive behaviors based on the estimation of the s...
This paper proposes an adaptive modular reinforcement learning architecture and an algorithm for rob...
The existing reinforcement learning methods have been seriously suffering from the curse of dimensio...
Abstract. The behavior of reinforcement learning (RL) algorithms is best understood in completely ob...
A Policy Gradient Reinforcement Learning (RL) technique is used to design the low level controllers ...
Abstract — In this paper, we propose a reinforcement learning approach to address multi-robot cooper...
This paper describes a reinforcement connec-tionist learning mechanism that allows a goal-directed a...
Designing distributed controllers for self-reconfiguring modular ro-bots has been consistently chall...
A fundamental challenge in robotics is controller design. While designing a robot\u27s individual be...
This paper presents an experimental investigation about Reinforcement Learning of multiple reactive ...
International audienceWithin this paper, a new kind of learning agents - so-called Constraint based ...
This paper focuses on two issues on learning and development; a problem of state-action space con-st...
Abstract—Life-time development of behavior learning seems based on not only self-learning architectu...
Abstract: This paper focuses on two issues on learning and development; a problem of state-action sp...
This paper proposes an efficient method of robot learning by which a set of pairs of a state and an ...
This paper proposes a method that acquires the pur-posive behaviors based on the estimation of the s...
This paper proposes an adaptive modular reinforcement learning architecture and an algorithm for rob...
The existing reinforcement learning methods have been seriously suffering from the curse of dimensio...
Abstract. The behavior of reinforcement learning (RL) algorithms is best understood in completely ob...
A Policy Gradient Reinforcement Learning (RL) technique is used to design the low level controllers ...
Abstract — In this paper, we propose a reinforcement learning approach to address multi-robot cooper...
This paper describes a reinforcement connec-tionist learning mechanism that allows a goal-directed a...
Designing distributed controllers for self-reconfiguring modular ro-bots has been consistently chall...
A fundamental challenge in robotics is controller design. While designing a robot\u27s individual be...
This paper presents an experimental investigation about Reinforcement Learning of multiple reactive ...
International audienceWithin this paper, a new kind of learning agents - so-called Constraint based ...
This paper focuses on two issues on learning and development; a problem of state-action space con-st...
Abstract—Life-time development of behavior learning seems based on not only self-learning architectu...
Abstract: This paper focuses on two issues on learning and development; a problem of state-action sp...
This paper proposes an efficient method of robot learning by which a set of pairs of a state and an ...