In this work, we present and study a training set-up that achieves fast policy generation for real-world robotic tasks by using massive parallelism on a single workstation GPU. We analyze and discuss the impact of different training algorithm components in the massively parallel regime on the final policy performance and training times. In addition, we present a novel game-inspired curriculum that is well suited for training with thousands of simulated robots in parallel. We evaluate the approach by training the quadrupedal robot ANYmal to walk on challenging terrain. The parallel approach allows training policies for flat terrain in under four minutes, and in twenty minutes for uneven terrain. This represents a speedup of multiple orders o...
Numerous algorithms have been proposed to allow legged robots to learn to walk. However, their vast ...
Conventionally programmed systems (e.g. robots) are not able to adapt to unforeseen changes in their...
Abstract — We present a learning system which is able to quickly and reliably acquire a robust feedb...
Reinforcement learning is an important family of algo-rithms that have been extremely effective in f...
For a robot, the ability to get from one place to another is one of the most basic skills. However, ...
Ambulation is a valuable form of locomotion for robots which must operate in spaces designed for hum...
Deep Reinforcement Learning (DRL) has been applied successfully to many robotic applications. Howeve...
This thesis studies the broad problem of learning robust control policies for difficult physics-base...
Reinforcement learning (RL) is an agent-based approach for teaching robots to navigate within the ph...
Building controllers for legged robots with agility and intelligence has been one of the typical cha...
Programming robots for performing different activities requires calculating sequences of values of t...
Reliable bipedal walking over complex terrain is a challenging problem, using a curriculum can help ...
In this paper, we explore a specific form of deep reinforcement learning (D-RL) technique for quadru...
Service robots have the potential to be of great value in households, health care and other labor in...
Deep reinforcement learning (DRL) offers a promising approach for the synthesis of control policies ...
Numerous algorithms have been proposed to allow legged robots to learn to walk. However, their vast ...
Conventionally programmed systems (e.g. robots) are not able to adapt to unforeseen changes in their...
Abstract — We present a learning system which is able to quickly and reliably acquire a robust feedb...
Reinforcement learning is an important family of algo-rithms that have been extremely effective in f...
For a robot, the ability to get from one place to another is one of the most basic skills. However, ...
Ambulation is a valuable form of locomotion for robots which must operate in spaces designed for hum...
Deep Reinforcement Learning (DRL) has been applied successfully to many robotic applications. Howeve...
This thesis studies the broad problem of learning robust control policies for difficult physics-base...
Reinforcement learning (RL) is an agent-based approach for teaching robots to navigate within the ph...
Building controllers for legged robots with agility and intelligence has been one of the typical cha...
Programming robots for performing different activities requires calculating sequences of values of t...
Reliable bipedal walking over complex terrain is a challenging problem, using a curriculum can help ...
In this paper, we explore a specific form of deep reinforcement learning (D-RL) technique for quadru...
Service robots have the potential to be of great value in households, health care and other labor in...
Deep reinforcement learning (DRL) offers a promising approach for the synthesis of control policies ...
Numerous algorithms have been proposed to allow legged robots to learn to walk. However, their vast ...
Conventionally programmed systems (e.g. robots) are not able to adapt to unforeseen changes in their...
Abstract — We present a learning system which is able to quickly and reliably acquire a robust feedb...