This work exploits action equivariance for representation learning in reinforcement learning. Equivariance under actions states that transitions in the input space are mirrored by equivalent transitions in latent space, while the map and transition functions should also commute. We introduce a contrastive loss function that enforces action equivariance on the learned representations. We prove that when our loss is zero, we have a homomorphism of a deterministic Markov Decision Process (MDP). Learning equivariant maps leads to structured latent spaces, allowing us to build a model on which we plan through value iteration. We show experimentally that for deterministic MDPs, the optimal policy in the abstract MDP can be successfully lifted to ...
We introduce a model-free algorithm for learning in Markov decision processes with parameterized act...
We define a metric for measuring behavior similarity between states in a Markov decision process (MD...
A common strategy in modern learning systems is to learn a representation that is useful for many ta...
This work exploits action equivariance for representation learning in reinforcement learning. Equiva...
To operate effectively in complex environments learning agents require the ability to form useful ab...
This paper introduces MDP homomorphic networks for deep reinforcement learning. MDP homomorphic netw...
This dissertation investigates the problem of representation discovery in discrete Markov decision p...
International audienceThis paper deals with the Inverse Reinforcement Learning framework, whose purp...
We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowled...
In this paper we present a new method for reinforcement learning in relational domains. A logical la...
Animals are able to rapidly infer from limited experience when sets of state action pairs have equiv...
© 2020 Dmitry GrebenyukA Markov decision process (MDP) cannot be used for learning end-to-end contro...
Summarization: The majority of learning algorithms available today focus on approximating the state ...
International audienceWe study the role of the representation of state-action value functions in reg...
We study the problem of learning a policy in a Markov decision process (MDP) based on observations o...
We introduce a model-free algorithm for learning in Markov decision processes with parameterized act...
We define a metric for measuring behavior similarity between states in a Markov decision process (MD...
A common strategy in modern learning systems is to learn a representation that is useful for many ta...
This work exploits action equivariance for representation learning in reinforcement learning. Equiva...
To operate effectively in complex environments learning agents require the ability to form useful ab...
This paper introduces MDP homomorphic networks for deep reinforcement learning. MDP homomorphic netw...
This dissertation investigates the problem of representation discovery in discrete Markov decision p...
International audienceThis paper deals with the Inverse Reinforcement Learning framework, whose purp...
We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowled...
In this paper we present a new method for reinforcement learning in relational domains. A logical la...
Animals are able to rapidly infer from limited experience when sets of state action pairs have equiv...
© 2020 Dmitry GrebenyukA Markov decision process (MDP) cannot be used for learning end-to-end contro...
Summarization: The majority of learning algorithms available today focus on approximating the state ...
International audienceWe study the role of the representation of state-action value functions in reg...
We study the problem of learning a policy in a Markov decision process (MDP) based on observations o...
We introduce a model-free algorithm for learning in Markov decision processes with parameterized act...
We define a metric for measuring behavior similarity between states in a Markov decision process (MD...
A common strategy in modern learning systems is to learn a representation that is useful for many ta...