Animals are able to rapidly infer from limited experience when sets of state action pairs have equivalent reward and transition dynamics. On the other hand, modern reinforcement learning systems must painstakingly learn through trial and error that sets of state action pairs are value equivalent -- requiring an often prohibitively large amount of samples from their environment. MDP homomorphisms have been proposed that reduce the observed MDP of an environment to an abstract MDP, which can enable more sample efficient policy learning. Consequently, impressive improvements in sample efficiency have been achieved when a suitable MDP homomorphism can be constructed a priori -- usually by exploiting a practioner's knowledge of environment symme...
It is crucial for agents, both biological and artificial, to acquire world models that veridically r...
Many works in explainable AI have focused on explaining black-box classification models. Explaining ...
We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowled...
To operate effectively in complex environments learning agents require the ability to form useful ab...
Abstraction plays an important role in the generalisation of knowledge and skills and is key to samp...
The application of reinforcement learning (RL) algorithms is often hindered by the combinatorial exp...
Many methods for Model-based Reinforcement learning (MBRL) in Markov decision processes (MDPs) provi...
International audienceIncorporating prior knowledge in reinforcement learning algorithms is mainly a...
Leveraging an equivalence property on the set of states of state-action pairs in anMarkov Decision P...
When reinforcement learning is applied with sparse rewards, agents must spend a prohibitively long t...
International audienceLeveraging an equivalence property in the state-space of a Markov Decision Pro...
How to learn an effective reinforcement learning-based model for control tasks from high-level visua...
We study reinforcement learning (RL) in settings where observations are high-dimensional, but where ...
This work exploits action equivariance for representation learning in reinforcement learning. Equiva...
© 2020 Dmitry GrebenyukA Markov decision process (MDP) cannot be used for learning end-to-end contro...
It is crucial for agents, both biological and artificial, to acquire world models that veridically r...
Many works in explainable AI have focused on explaining black-box classification models. Explaining ...
We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowled...
To operate effectively in complex environments learning agents require the ability to form useful ab...
Abstraction plays an important role in the generalisation of knowledge and skills and is key to samp...
The application of reinforcement learning (RL) algorithms is often hindered by the combinatorial exp...
Many methods for Model-based Reinforcement learning (MBRL) in Markov decision processes (MDPs) provi...
International audienceIncorporating prior knowledge in reinforcement learning algorithms is mainly a...
Leveraging an equivalence property on the set of states of state-action pairs in anMarkov Decision P...
When reinforcement learning is applied with sparse rewards, agents must spend a prohibitively long t...
International audienceLeveraging an equivalence property in the state-space of a Markov Decision Pro...
How to learn an effective reinforcement learning-based model for control tasks from high-level visua...
We study reinforcement learning (RL) in settings where observations are high-dimensional, but where ...
This work exploits action equivariance for representation learning in reinforcement learning. Equiva...
© 2020 Dmitry GrebenyukA Markov decision process (MDP) cannot be used for learning end-to-end contro...
It is crucial for agents, both biological and artificial, to acquire world models that veridically r...
Many works in explainable AI have focused on explaining black-box classification models. Explaining ...
We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowled...