Reinforcement Learning (RL) in either fully or partially observable domains usually poses a requirement on the knowledge representation in order to be sound: the underlying stochastic process must be Markovian. In many applications, including those involving interactions between multiple agents (e.g., humans and robots), sources of uncertainty affect rewards and transition dynamics in such a way that a Markovian representation would be computationally very expensive. An alternative formulation of the decision problem involves partially specified behaviors with choice points. While this reduces the complexity of the policy space that must be explored - something that is crucial for realistic autonomous agents that must bound search time - it...
A large class of problems of sequential decision making under uncertainty, of which the underlying p...
Thesis (Ph.D.)--University of Washington, 2020Informed and robust decision making in the face of unc...
The first part of a two-part series of papers provides a survey on recent advances in Deep Reinforce...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
The Markov decision process (MDP) formulation used to model many real-world sequential decision maki...
Policy search algorithms have facilitated application of Reinforcement Learning (RL) to dynamic syst...
The standard RL world model is that of a Markov Decision Process (MDP). A basic premise of MDPs is t...
Reinforcement Learning (RL) is currently an active research area of Artificial Intelligence (AI) in ...
Learning to play in the presence of independent and self-motivated opponents is a difficult task, be...
Reinforcement learning methods are being applied to control problems in robotics domain. These algor...
We consider un-discounted reinforcement learning (RL) in Markov decision processes (MDPs) under dri...
<div><p>Humans can learn under a wide variety of feedback conditions. Reinforcement learning (RL), w...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
To quickly achieve good performance, reinforcement-learning algorithms for acting in large continuou...
A large class of problems of sequential decision making under uncertainty, of which the underlying p...
Thesis (Ph.D.)--University of Washington, 2020Informed and robust decision making in the face of unc...
The first part of a two-part series of papers provides a survey on recent advances in Deep Reinforce...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
The Markov decision process (MDP) formulation used to model many real-world sequential decision maki...
Policy search algorithms have facilitated application of Reinforcement Learning (RL) to dynamic syst...
The standard RL world model is that of a Markov Decision Process (MDP). A basic premise of MDPs is t...
Reinforcement Learning (RL) is currently an active research area of Artificial Intelligence (AI) in ...
Learning to play in the presence of independent and self-motivated opponents is a difficult task, be...
Reinforcement learning methods are being applied to control problems in robotics domain. These algor...
We consider un-discounted reinforcement learning (RL) in Markov decision processes (MDPs) under dri...
<div><p>Humans can learn under a wide variety of feedback conditions. Reinforcement learning (RL), w...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
To quickly achieve good performance, reinforcement-learning algorithms for acting in large continuou...
A large class of problems of sequential decision making under uncertainty, of which the underlying p...
Thesis (Ph.D.)--University of Washington, 2020Informed and robust decision making in the face of unc...
The first part of a two-part series of papers provides a survey on recent advances in Deep Reinforce...