2017 Summer.Includes bibliographical references.This dissertation presents knowledge acquisition and retention methods for efficient and robust learning. We propose a framework for learning and memorizing, and we examine how we can use the memory for efficient machine learning. Temporal difference (TD) learning is a core part of reinforcement learning, and it requires function approximation. However, with function approximation, the most popular TD methods such as TD(λ), SARSA, and Q-learning lose stability and diverge especially when the complexity of the problem grows and the sampling distribution is biased. The biased samples cause function approximators such as neural networks to respond quickly to the new data by losing what was previo...
Reinforcement learning is a powerful approach for learning control policies that solve sequential de...
Most provably-efficient reinforcement learning algorithms introduce opti-mism about poorly-understoo...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
Treballs Finals de Grau de Física, Facultat de Física, Universitat de Barcelona, Any: 2015, Tutor: P...
We present a Bayesian reinforcement learning model with a working memory module which can solve some...
We address the problem of Bayesian reinforcement learning using efficient model-based online plannin...
We investigate sparse representations for control in reinforcement learning. While these representat...
Reinforcement learning agents can successfully learn in a variety of difficult tasks. A fundamental ...
One of the most challenging types of environments for a Deep Reinforcement Learning agent to learn i...
A key aspect of artificial intelligence is the ability to learn from experience. If examples of corr...
International audienceA wide variety of function approximation schemes have been applied to reinforc...
In this master thesis, we have tried to solve two of most prominent Reinforcement Learning problems:...
In many complex robot applications, such as grasping and manipulation, it is difficult to program de...
My research attempts to address on-line action selection in reinforcement learning from a Bayesian p...
Reinforcement learning is a powerful approach for learning control policies that solve sequential de...
Most provably-efficient reinforcement learning algorithms introduce opti-mism about poorly-understoo...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
Reinforcement Learning (RL) algorithms allow artificial agents to improve their action selection pol...
Treballs Finals de Grau de Física, Facultat de Física, Universitat de Barcelona, Any: 2015, Tutor: P...
We present a Bayesian reinforcement learning model with a working memory module which can solve some...
We address the problem of Bayesian reinforcement learning using efficient model-based online plannin...
We investigate sparse representations for control in reinforcement learning. While these representat...
Reinforcement learning agents can successfully learn in a variety of difficult tasks. A fundamental ...
One of the most challenging types of environments for a Deep Reinforcement Learning agent to learn i...
A key aspect of artificial intelligence is the ability to learn from experience. If examples of corr...
International audienceA wide variety of function approximation schemes have been applied to reinforc...
In this master thesis, we have tried to solve two of most prominent Reinforcement Learning problems:...
In many complex robot applications, such as grasping and manipulation, it is difficult to program de...
My research attempts to address on-line action selection in reinforcement learning from a Bayesian p...
Reinforcement learning is a powerful approach for learning control policies that solve sequential de...
Most provably-efficient reinforcement learning algorithms introduce opti-mism about poorly-understoo...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...