While reinforcement learning (RL) methods that learn an internal model of the environment have the potential to be more sample efficient than their model-free counterparts, learning to model raw observations from high dimensional sensors can be challenging. Prior work has addressed this challenge by learning low-dimensional representation of observations through auxiliary objectives, such as reconstruction or value prediction. However, the alignment between these auxiliary objectives and the RL objective is often unclear. In this work, we propose a single objective which jointly optimizes a latent-space model and policy to achieve high returns while remaining self-consistent. This objective is a lower bound on expected returns. Unlike prior...
Humans can develop their internal model of the external world and use it for decision making. Reinfo...
The impetus for exploration in reinforcement learning (RL) is decreasing uncertainty about the envir...
Visual model-based RL methods typically encode image observations into low-dimensional representatio...
We study reinforcement learning (RL) in settings where observations are high-dimensional, but where ...
Many methods for Model-based Reinforcement learning (MBRL) in Markov decision processes (MDPs) provi...
Reinforcement Learning (RL) can enable agents to learn complex tasks. However, it is difficult to in...
Achieving sample efficiency in online episodic reinforcement learning (RL) requires optimally balanc...
Reinforcement learning (RL) algorithms have proven transformative in a range of domains. To tackle r...
Model-based reinforcement learning (MBRL) is believed to have much higher sample efficiency compared...
Reinforcement learning (RL) is a machine learning paradigm where an agent learns to interact with an...
Existing offline reinforcement learning (RL) algorithms typically assume that training data is eithe...
In supervised deep learning, learning good representations for remote--sensing images (RSI) relies o...
In this master thesis, we have tried to solve two of most prominent Reinforcement Learning problems:...
Training an agent to solve control tasks directly from high-dimensional images with model-free reinf...
Reinforcement learning (RL) is a general framework for learning and evaluating intelligent behaviors...
Humans can develop their internal model of the external world and use it for decision making. Reinfo...
The impetus for exploration in reinforcement learning (RL) is decreasing uncertainty about the envir...
Visual model-based RL methods typically encode image observations into low-dimensional representatio...
We study reinforcement learning (RL) in settings where observations are high-dimensional, but where ...
Many methods for Model-based Reinforcement learning (MBRL) in Markov decision processes (MDPs) provi...
Reinforcement Learning (RL) can enable agents to learn complex tasks. However, it is difficult to in...
Achieving sample efficiency in online episodic reinforcement learning (RL) requires optimally balanc...
Reinforcement learning (RL) algorithms have proven transformative in a range of domains. To tackle r...
Model-based reinforcement learning (MBRL) is believed to have much higher sample efficiency compared...
Reinforcement learning (RL) is a machine learning paradigm where an agent learns to interact with an...
Existing offline reinforcement learning (RL) algorithms typically assume that training data is eithe...
In supervised deep learning, learning good representations for remote--sensing images (RSI) relies o...
In this master thesis, we have tried to solve two of most prominent Reinforcement Learning problems:...
Training an agent to solve control tasks directly from high-dimensional images with model-free reinf...
Reinforcement learning (RL) is a general framework for learning and evaluating intelligent behaviors...
Humans can develop their internal model of the external world and use it for decision making. Reinfo...
The impetus for exploration in reinforcement learning (RL) is decreasing uncertainty about the envir...
Visual model-based RL methods typically encode image observations into low-dimensional representatio...