In this paper, we describe how techniques from reinforcement learning might be used to approach the problem of acting under uncertainty. We start by introducing the theory of partially observable Markov decision processes (POMDPs) to describe what we call hidden state problems. After a brief review of other POMDP solution techniques, we motivate reinforcement learning by considering an agent with no previous knowledge of the environment model. We describe two major groups of reinforcement learning techniques: those the learn a value function over states of world, and those that search in the space of policies directly. Finally, we discuss the general problems with these methods, and suggest promising avenues for future research.
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
Abstract. The Factored Markov Decision Process (FMDP) framework is a stan-dard representation for se...
This thesis is accomplished in the context of the industrial simulation domain that addresses the pr...
Partially observable Markov decision processes (POMDPs) are interesting because they provide a gener...
People are efficient when they make decisions under uncertainty, even when their decisions have long...
In applying reinforcement learning to agents acting in the real world we are often faced with tasks ...
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning contr...
A very general framework for modeling uncertainty in learning environments is given by Partially Obs...
AbstractTechniques based on reinforcement learning (RL) have been used to build systems that learn t...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
AbstractActing in domains where an agent must plan several steps ahead to achieve a goal can be a ch...
The problem of making optimal decisions in uncertain conditions is central to Artificial Intelligenc...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...
Bayesian Reinforcement Learning has generated substantial interest recently, as it provides an elega...
Reinforcement learning (RL) is a well-known class of machine learning algorithms used in planning an...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
Abstract. The Factored Markov Decision Process (FMDP) framework is a stan-dard representation for se...
This thesis is accomplished in the context of the industrial simulation domain that addresses the pr...
Partially observable Markov decision processes (POMDPs) are interesting because they provide a gener...
People are efficient when they make decisions under uncertainty, even when their decisions have long...
In applying reinforcement learning to agents acting in the real world we are often faced with tasks ...
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning contr...
A very general framework for modeling uncertainty in learning environments is given by Partially Obs...
AbstractTechniques based on reinforcement learning (RL) have been used to build systems that learn t...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
AbstractActing in domains where an agent must plan several steps ahead to achieve a goal can be a ch...
The problem of making optimal decisions in uncertain conditions is central to Artificial Intelligenc...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...
Bayesian Reinforcement Learning has generated substantial interest recently, as it provides an elega...
Reinforcement learning (RL) is a well-known class of machine learning algorithms used in planning an...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
Abstract. The Factored Markov Decision Process (FMDP) framework is a stan-dard representation for se...
This thesis is accomplished in the context of the industrial simulation domain that addresses the pr...