Abstract. The Factored Markov Decision Process (FMDP) framework is a stan-dard representation for sequential decision problems under uncertainty where the state is represented as a collection of random variables. Factored Reinforcement Learning (FRL) is an Model-based Reinforcement Learning approach to FMDPs where the transition and reward functions of the problem are learned. In this pa-per, we show how to model in a theoretically well-founded way the problems where some combinations of state variable values may not occur, giving rise to impossible states. Furthermore, we propose a new heuristics that considers as im-possible the states that have not been seen so far. We derive an algorithm whose improvement in performance with respect to ...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...
Any reinforcement learning algorithm that applies to all Markov decision processes (MDPs) will suer ...
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are thos...
International audienceThe Factored Markov Decision Process (fmdp) framework is a standard representa...
This thesis is accomplished in the context of the industrial simulation domain that addresses the pr...
In this paper, we describe how techniques from reinforcement learning might be used to approach the ...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
The problem of selecting the right state-representation in a reinforcement learning problem is consi...
Sequential decision making from experience, or reinforcement learning (RL), is a paradigm that is we...
AbstractTechniques based on reinforcement learning (RL) have been used to build systems that learn t...
This thesis is accomplished in the context of the industrial simulation domain that addresses the pr...
International audienceMarkovian systems are widely used in reinforcement learning (RL), when the suc...
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning contr...
Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, us...
Reinforcement Learning (RL) is currently an active research area of Artificial Intelligence (AI) in ...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...
Any reinforcement learning algorithm that applies to all Markov decision processes (MDPs) will suer ...
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are thos...
International audienceThe Factored Markov Decision Process (fmdp) framework is a standard representa...
This thesis is accomplished in the context of the industrial simulation domain that addresses the pr...
In this paper, we describe how techniques from reinforcement learning might be used to approach the ...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
The problem of selecting the right state-representation in a reinforcement learning problem is consi...
Sequential decision making from experience, or reinforcement learning (RL), is a paradigm that is we...
AbstractTechniques based on reinforcement learning (RL) have been used to build systems that learn t...
This thesis is accomplished in the context of the industrial simulation domain that addresses the pr...
International audienceMarkovian systems are widely used in reinforcement learning (RL), when the suc...
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning contr...
Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, us...
Reinforcement Learning (RL) is currently an active research area of Artificial Intelligence (AI) in ...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...
Any reinforcement learning algorithm that applies to all Markov decision processes (MDPs) will suer ...
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are thos...