International audienceThe Factored Markov Decision Process (fmdp) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a collection of random variables. Factored Reinforcement Learning (frl) is an Model-based Reinforcement Learning approach to fmdps where the transition and reward functions of the problem are learned. In this paper, we show how to model in a theoretically well-founded way the problems where some combinations of state variable values may not occur, giving rise to impossible states. Furthermore, we propose a new heuristics that considers as impossible the states that have not been seen so far. We derive an algorithm whose improvement in performance with re...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...
Any reinforcement learning algorithm that applies to all Markov decision processes (MDPs) will suer ...
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are thos...
Abstract. The Factored Markov Decision Process (FMDP) framework is a stan-dard representation for se...
This thesis is accomplished in the context of the industrial simulation domain that addresses the pr...
In this paper, we describe how techniques from reinforcement learning might be used to approach the ...
Sequential decision making from experience, or reinforcement learning (RL), is a paradigm that is we...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
The problem of selecting the right state-representation in a reinforcement learning problem is consi...
International audienceMarkovian systems are widely used in reinforcement learning (RL), when the suc...
This thesis is accomplished in the context of the industrial simulation domain that addresses the pr...
AbstractTechniques based on reinforcement learning (RL) have been used to build systems that learn t...
International audienceWe consider a reinforcement learning setting where the learner does not have e...
Reinforcement Learning (RL) is currently an active research area of Artificial Intelligence (AI) in ...
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning contr...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...
Any reinforcement learning algorithm that applies to all Markov decision processes (MDPs) will suer ...
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are thos...
Abstract. The Factored Markov Decision Process (FMDP) framework is a stan-dard representation for se...
This thesis is accomplished in the context of the industrial simulation domain that addresses the pr...
In this paper, we describe how techniques from reinforcement learning might be used to approach the ...
Sequential decision making from experience, or reinforcement learning (RL), is a paradigm that is we...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
The problem of selecting the right state-representation in a reinforcement learning problem is consi...
International audienceMarkovian systems are widely used in reinforcement learning (RL), when the suc...
This thesis is accomplished in the context of the industrial simulation domain that addresses the pr...
AbstractTechniques based on reinforcement learning (RL) have been used to build systems that learn t...
International audienceWe consider a reinforcement learning setting where the learner does not have e...
Reinforcement Learning (RL) is currently an active research area of Artificial Intelligence (AI) in ...
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning contr...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...
Any reinforcement learning algorithm that applies to all Markov decision processes (MDPs) will suer ...
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are thos...