People learn skills by interacting with their surroundings from the time of their birth. Reinforcement learning (RL), learning a decision-making strategy (policy) to maximize a scalar reward signal by trial and error, offers such a learning paradigm to learn from surroundings. However, most of the current RL algorithms suffer from sample inefficiency: training an agent typically needs millions of samples. This thesis discusses model-based RL that is able to learn a policy to control robots from scratch with significantly fewer samples. Especially, this thesis focuses on the case where observations are high dimensional pixels. To achieve this goal, we first explain essential components to learn a latent dynamics model from high dimensi...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
Behavioral control has been an effective method for controlling low-level motion for autonomous agen...
To quickly achieve good performance, reinforcement-learning algorithms for acting in large continuou...
Data-efficient reinforcement learning (RL) in continuous state-action spaces using very high-dimensi...
Data-efficient learning in continuous state-action spaces using very high-dimensional observations r...
peer reviewedWe report in this paper some positive simulation results obtained when image pixels are...
Reinforcement learning (RL) aims at autonomously performing complex tasks. To this end, a reward sig...
Reinforcement learning (RL) is a general framework for learning and evaluating intelligent behaviors...
For robots to perform tasks in the unstructured environments of the real world, they must be able to...
Reinforcement learning is a powerful approach for learning control policies that solve sequential de...
This thesis focuses on Reinforcement Learning (RL) which considers an agent that makes sequen- tial ...
There has been success in recent years for neural networks in applications requiring high level inte...
Abstract. The behavior of reinforcement learning (RL) algorithms is best understood in completely ob...
Reinforcement Learning (RL) is a popular method in machine learning. In RL, an agent learns a policy...
Training an agent to solve control tasks directly from high-dimensional images with model-free reinf...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
Behavioral control has been an effective method for controlling low-level motion for autonomous agen...
To quickly achieve good performance, reinforcement-learning algorithms for acting in large continuou...
Data-efficient reinforcement learning (RL) in continuous state-action spaces using very high-dimensi...
Data-efficient learning in continuous state-action spaces using very high-dimensional observations r...
peer reviewedWe report in this paper some positive simulation results obtained when image pixels are...
Reinforcement learning (RL) aims at autonomously performing complex tasks. To this end, a reward sig...
Reinforcement learning (RL) is a general framework for learning and evaluating intelligent behaviors...
For robots to perform tasks in the unstructured environments of the real world, they must be able to...
Reinforcement learning is a powerful approach for learning control policies that solve sequential de...
This thesis focuses on Reinforcement Learning (RL) which considers an agent that makes sequen- tial ...
There has been success in recent years for neural networks in applications requiring high level inte...
Abstract. The behavior of reinforcement learning (RL) algorithms is best understood in completely ob...
Reinforcement Learning (RL) is a popular method in machine learning. In RL, an agent learns a policy...
Training an agent to solve control tasks directly from high-dimensional images with model-free reinf...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
Behavioral control has been an effective method for controlling low-level motion for autonomous agen...
To quickly achieve good performance, reinforcement-learning algorithms for acting in large continuou...