Pure reinforcement learning does not scale well to domains with many degrees of freedom and particularly to continuous domains. In this thesis, we introduce a hybrid method in which a symbolic planner constructs all approximate solution to a control problem.. Subsequently, a numerical optimisation algorithm is used to refine the qualitative plan into an operational policy. The method is demonstrated on the problem of learning a stable walking gait for a bipedal robot.The contributions of this thesis are as follows. Firstly, the thesis proposes a novel way to generate gait patterns by using a genetic algorithm to generate walking gaits for a humanoid robot using zero moment point as the stability criterion. This is validated on physical rob...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
This paper presents a method for learning the parameters of rhythmic walking to generate a purposive...
Gait learning is usually under a so-called simulation based framework, where a simulation platform i...
AbstractThis paper demonstrates application of Reinforcement Learning to optimization of control of ...
In ZMP trajectory generation using simple models, often a considerable amount of trials and errors a...
Bipedal walking is a challenging task for humanoid robots. In this study, we develop a lightweight r...
Locomotion control has long been vital to legged robots. Agile locomotion can be implemented through...
We investigate learning of flexible robot locomotion controllers, i.e., the controllers should be ap...
In this paper, we proposed a novel Hybrid Reinforcement Learning framework to maintain the stability...
This paper presents a method for learning the parameters of rhythmic walking to generate pur-posive ...
Abstract. This paper presents a novel dynamic control approach to acquire biped walking of humanoid ...
Programming robots for performing different activities requires calculating sequences of values of t...
Abstract—In the field of robotics there is a great interest in developing strategies and algorithms ...
Reinforcement learning provides a general framework for achieving autonomy and diversity in traditio...
Robots hold the promise of becoming useful helpers for many dangerous, laborious, or unpleasant task...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
This paper presents a method for learning the parameters of rhythmic walking to generate a purposive...
Gait learning is usually under a so-called simulation based framework, where a simulation platform i...
AbstractThis paper demonstrates application of Reinforcement Learning to optimization of control of ...
In ZMP trajectory generation using simple models, often a considerable amount of trials and errors a...
Bipedal walking is a challenging task for humanoid robots. In this study, we develop a lightweight r...
Locomotion control has long been vital to legged robots. Agile locomotion can be implemented through...
We investigate learning of flexible robot locomotion controllers, i.e., the controllers should be ap...
In this paper, we proposed a novel Hybrid Reinforcement Learning framework to maintain the stability...
This paper presents a method for learning the parameters of rhythmic walking to generate pur-posive ...
Abstract. This paper presents a novel dynamic control approach to acquire biped walking of humanoid ...
Programming robots for performing different activities requires calculating sequences of values of t...
Abstract—In the field of robotics there is a great interest in developing strategies and algorithms ...
Reinforcement learning provides a general framework for achieving autonomy and diversity in traditio...
Robots hold the promise of becoming useful helpers for many dangerous, laborious, or unpleasant task...
Due to copyright restrictions, the access to the full text of this article is only available via sub...
This paper presents a method for learning the parameters of rhythmic walking to generate a purposive...
Gait learning is usually under a so-called simulation based framework, where a simulation platform i...