This dissertation investigates the problem of representation discovery in discrete Markov decision processes, namely how agents can simultaneously learn representation and optimal control. Previous work on function approximation techniques for MDPs largely employed hand-engineered basis functions. In this dissertation, we explore approaches to automatically construct these basis functions and demonstrate that automatically constructed basis functions significantly outperform more traditional, hand-engineered approaches. We specifically examine two problems: how to automatically build representations for action-value functions by explicitly incorporating actions into a representation, and how representations can be automatically constructed ...
This paper addresses a fundamental issue central to approximation methods for solving large Markov d...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, us...
This dissertation investigates the problem of representation discovery in discrete Markov decision p...
This paper introduces a new approach to actionvalue function approximation by learning basis functio...
The ease or difficulty in solving a problem strongly depends on the way it is represented. For examp...
Solving Markov decision processes (MDPs) efficiently is challenging in many cases, for example, when...
We address the problem of automatically constructing basis functions for linear approxim...
Automatically constructing novel representations of tasks from analysis of state spaces is a longsta...
The ease or difficulty in solving a problemstrongly depends on the way it is represented. For exampl...
State abstraction and value function approximation are essential tools for the feasibility of sequen...
To operate effectively in complex environments learning agents require the ability to form useful ab...
Markov decision processes (MDPs) with discrete and continuous state and action components can be so...
Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, us...
The application of reinforcement learning (RL) algorithms is often hindered by the combinatorial exp...
This paper addresses a fundamental issue central to approximation methods for solving large Markov d...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, us...
This dissertation investigates the problem of representation discovery in discrete Markov decision p...
This paper introduces a new approach to actionvalue function approximation by learning basis functio...
The ease or difficulty in solving a problem strongly depends on the way it is represented. For examp...
Solving Markov decision processes (MDPs) efficiently is challenging in many cases, for example, when...
We address the problem of automatically constructing basis functions for linear approxim...
Automatically constructing novel representations of tasks from analysis of state spaces is a longsta...
The ease or difficulty in solving a problemstrongly depends on the way it is represented. For exampl...
State abstraction and value function approximation are essential tools for the feasibility of sequen...
To operate effectively in complex environments learning agents require the ability to form useful ab...
Markov decision processes (MDPs) with discrete and continuous state and action components can be so...
Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, us...
The application of reinforcement learning (RL) algorithms is often hindered by the combinatorial exp...
This paper addresses a fundamental issue central to approximation methods for solving large Markov d...
Abstract The problem of reinforcement learning in a non-Markov environment isexplored using a dynami...
Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, us...