This paper introduces a new approach to actionvalue function approximation by learning basis functions from a spectral decomposition of the state-action manifold. This paper extends previous work on using Laplacian bases for value function approximation by using the actions of the agent as part of the representation when creating basis functions. The approach results in a nonlinear learned representation particularly suited to approximating action-value functions, without incurring the wasteful duplication of state bases in previous work. We discuss two techniques to create state-action graphs: offpolicy and on-policy. We show that these graphs have a greater expressive power and have better performance over state-based Laplacian basis func...
AbstractMarkov Decision Processes (MDPs) describe a wide variety of planning scenarios ranging from ...
This paper presents a new method for the autonomous construction of hierarchical action and state re...
We investigate the problem of automatically constructing efficient rep-resentations or basis functio...
This dissertation investigates the problem of representation discovery in discrete Markov decision p...
This dissertation investigates the problem of representation discovery in discrete Markov decision p...
Basis functions derived from an undirected graph connecting nearby samples from a Markov decision pr...
This paper addresses a fundamental issue central to approximation methods for solving large Markov d...
We address the problem of automatically constructing basis functions for linear approxim...
The ease or difficulty in solving a problem strongly depends on the way it is represented. For examp...
We investigate the problem of automatically constructing efficient representations or basis function...
The ease or difficulty in solving a problemstrongly depends on the way it is represented. For exampl...
Markov decision processes (MDPs) with discrete and continuous state and action components can be so...
Summarization: The majority of learning algorithms available today focus on approximating the state ...
Solving Markov decision processes (MDPs) efficiently is challenging in many cases, for example, when...
State abstraction and value function approximation are essential tools for the feasibility of sequen...
AbstractMarkov Decision Processes (MDPs) describe a wide variety of planning scenarios ranging from ...
This paper presents a new method for the autonomous construction of hierarchical action and state re...
We investigate the problem of automatically constructing efficient rep-resentations or basis functio...
This dissertation investigates the problem of representation discovery in discrete Markov decision p...
This dissertation investigates the problem of representation discovery in discrete Markov decision p...
Basis functions derived from an undirected graph connecting nearby samples from a Markov decision pr...
This paper addresses a fundamental issue central to approximation methods for solving large Markov d...
We address the problem of automatically constructing basis functions for linear approxim...
The ease or difficulty in solving a problem strongly depends on the way it is represented. For examp...
We investigate the problem of automatically constructing efficient representations or basis function...
The ease or difficulty in solving a problemstrongly depends on the way it is represented. For exampl...
Markov decision processes (MDPs) with discrete and continuous state and action components can be so...
Summarization: The majority of learning algorithms available today focus on approximating the state ...
Solving Markov decision processes (MDPs) efficiently is challenging in many cases, for example, when...
State abstraction and value function approximation are essential tools for the feasibility of sequen...
AbstractMarkov Decision Processes (MDPs) describe a wide variety of planning scenarios ranging from ...
This paper presents a new method for the autonomous construction of hierarchical action and state re...
We investigate the problem of automatically constructing efficient rep-resentations or basis functio...