In the quest for efficient and robust reinforcement learning methods, both model-free and model-based approaches offer advantages. In this paper we propose a new way of explicitly bridging both approaches via a shared low-dimensional learned encoding of the environment, meant to capture summarizing abstractions. We show that the modularity brought by this approach leads to good generalization while being computationally efficient, with planning happening in a smaller latent state space. In addition, this approach recovers a sufficient low-dimensional representation of the environment, which opens up new strategies for interpretable AI, exploration and transfer learning
In this paper we present a new method for reinforcement learning in relational domains. A logical la...
Reinforcement learning (RL) provides a general framework for data-driven decision making. However, t...
Deep reinforcement learning methods are capable of learning complex heuristics starting with no prio...
We introduce an algorithm for model-based hierarchical reinforcement learning to acquire self-contai...
Reinforcement learning (RL) models the learning process of humans, but as exciting advances are made...
Agents (humans, mice, computers) need to constantly make decisions to survive and thrive in their e...
Reinforcement learning has long been advertised as the one with the capability to intelligently mimi...
Presented online via Bluejeans Events on September 15, 2021 at 12:15 p.m.Alekh Agarwal is a research...
We are interested in the following general question: is it pos-\ud sible to abstract knowledge that ...
To operate effectively in complex environments learning agents require the ability to form useful ab...
Research in Artificial Intelligence (AI) has focused mostly on two extremes: either on small improve...
Realistic domains for learning possess regularities that make it possible to generalize experience a...
Reinforcement learning presents a challenging problem: agents must generalize experiences, efficient...
This paper introduces a novel approach for abstraction selection in reinforcement learning problems ...
The application of reinforcement learning (RL) algorithms is often hindered by the combinatorial exp...
In this paper we present a new method for reinforcement learning in relational domains. A logical la...
Reinforcement learning (RL) provides a general framework for data-driven decision making. However, t...
Deep reinforcement learning methods are capable of learning complex heuristics starting with no prio...
We introduce an algorithm for model-based hierarchical reinforcement learning to acquire self-contai...
Reinforcement learning (RL) models the learning process of humans, but as exciting advances are made...
Agents (humans, mice, computers) need to constantly make decisions to survive and thrive in their e...
Reinforcement learning has long been advertised as the one with the capability to intelligently mimi...
Presented online via Bluejeans Events on September 15, 2021 at 12:15 p.m.Alekh Agarwal is a research...
We are interested in the following general question: is it pos-\ud sible to abstract knowledge that ...
To operate effectively in complex environments learning agents require the ability to form useful ab...
Research in Artificial Intelligence (AI) has focused mostly on two extremes: either on small improve...
Realistic domains for learning possess regularities that make it possible to generalize experience a...
Reinforcement learning presents a challenging problem: agents must generalize experiences, efficient...
This paper introduces a novel approach for abstraction selection in reinforcement learning problems ...
The application of reinforcement learning (RL) algorithms is often hindered by the combinatorial exp...
In this paper we present a new method for reinforcement learning in relational domains. A logical la...
Reinforcement learning (RL) provides a general framework for data-driven decision making. However, t...
Deep reinforcement learning methods are capable of learning complex heuristics starting with no prio...