© 2020 Dmitry GrebenyukA Markov decision process (MDP) cannot be used for learning end-to-end control policies in Reinforcement Learning when the dimension of the feature vectors changes from one trial to the next. For example, this difference is present in an environment where the number of blocks to manipulate can vary. Because we cannot learn a different policy for each number of blocks, we suggest framing the problem as a POMDP instead of the MDP. It allows us to construct a constant observation space for a dynamic state space. There are two ways we can achieve such construction. First, we can design a hand-crafted set of observations for a particular problem. However, that set cannot be readily transferred to another problem, and it o...
This paper provides new techniques for abstracting the state space of a Markov Decision Process (MD...
The application of reinforcement learning (RL) algorithms is often hindered by the combinatorial exp...
In the sequential decision making setting, an agent aims to achieve systematic generalization over a...
A reinforcement-learning agent learns by trying actions and observing resulting reward in each state...
Reinforcement learning (RL) has seen widespread success in creating intelligent agents in several ch...
General purpose intelligent learning agents cycle through (complex,non-MDP) sequences of observatio...
General purpose intelligent learning agents cycle through (complex,non-MDP) sequences of observation...
International audienceThis paper deals with the Inverse Reinforcement Learning framework, whose purp...
In recent decades, Reinforcement Learning (RL) has emerged as an effective approach to address compl...
Summarization: Good policies in reinforcement learning problems typically exhibit significant struct...
In this paper, we give a brief review of Markov Decision Processes (MDPs), and how Reinforcement Lea...
Feature selection is an important challenge in machine learning. Unfortunately, most methods for aut...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
In the sequential decision making setting, an agent aims to achieve systematic generalization over a...
We consider the problem of learning a policy for a Markov decision process consistent with data capt...
This paper provides new techniques for abstracting the state space of a Markov Decision Process (MD...
The application of reinforcement learning (RL) algorithms is often hindered by the combinatorial exp...
In the sequential decision making setting, an agent aims to achieve systematic generalization over a...
A reinforcement-learning agent learns by trying actions and observing resulting reward in each state...
Reinforcement learning (RL) has seen widespread success in creating intelligent agents in several ch...
General purpose intelligent learning agents cycle through (complex,non-MDP) sequences of observatio...
General purpose intelligent learning agents cycle through (complex,non-MDP) sequences of observation...
International audienceThis paper deals with the Inverse Reinforcement Learning framework, whose purp...
In recent decades, Reinforcement Learning (RL) has emerged as an effective approach to address compl...
Summarization: Good policies in reinforcement learning problems typically exhibit significant struct...
In this paper, we give a brief review of Markov Decision Processes (MDPs), and how Reinforcement Lea...
Feature selection is an important challenge in machine learning. Unfortunately, most methods for aut...
The field of Reinforcement Learning is concerned with teaching agents to take optimal decisions t...
In the sequential decision making setting, an agent aims to achieve systematic generalization over a...
We consider the problem of learning a policy for a Markov decision process consistent with data capt...
This paper provides new techniques for abstracting the state space of a Markov Decision Process (MD...
The application of reinforcement learning (RL) algorithms is often hindered by the combinatorial exp...
In the sequential decision making setting, an agent aims to achieve systematic generalization over a...