In reinforcement learning it is frequently necessary to resort to an approximation to the true optimal value function. Here we investigate the bene ts of online search in such cases. We examine \local " searches, where the agent performs a nite-depth lookahead search, and \global " searches, where the agent performs a search for a trajectory all the way from the current state to a goal state. The key to the success of these methods lies in taking a value function, which gives a rough solution to the hard problem of nding good trajectories from every single state, and combining that with online search, which then gives an accurate solution to the easier problem of nding a good trajectory speci cally from the current state.
Reinforcement learning requires exploration, leading to repeated execution of sub-optimal actions. N...
We investigate the role of learning in search-based systems for solving optimization problems. We us...
In search problems, a mobile searcher seeks to locate a target that hides in some unknown position o...
In this paper, we describe methods for efficiently com-puting better solutions to control problems i...
Reinforcement learning methods can be used to improve the performance of local search algorithms for...
In this paper a reinforcement learning methodology for automatic online algorithm selection is intro...
This paper presents a framework allowing to tune continual exploration in an optimal way. It first q...
We describe a reinforcement learning-based variation to the combinatorial optimization technique kno...
We consider the problem of finding the best features for value function approximation in reinforceme...
This paper formalizes the problem of choosing online the number of explorations in a local search al...
Effective solving of constraint problems often requires choosing good or specific search heuristics....
On-policy reinforcement learning provides online adaptation, a characteristic of intelligent systems...
This paper presents two general approaches that address the problems of the local character of the s...
How e#ciently can we search an unknown environment for a goal in unknown position? How much would i...
Abstract. This paper presents a framework allowing to tune continual explo-ration in an optimal way....
Reinforcement learning requires exploration, leading to repeated execution of sub-optimal actions. N...
We investigate the role of learning in search-based systems for solving optimization problems. We us...
In search problems, a mobile searcher seeks to locate a target that hides in some unknown position o...
In this paper, we describe methods for efficiently com-puting better solutions to control problems i...
Reinforcement learning methods can be used to improve the performance of local search algorithms for...
In this paper a reinforcement learning methodology for automatic online algorithm selection is intro...
This paper presents a framework allowing to tune continual exploration in an optimal way. It first q...
We describe a reinforcement learning-based variation to the combinatorial optimization technique kno...
We consider the problem of finding the best features for value function approximation in reinforceme...
This paper formalizes the problem of choosing online the number of explorations in a local search al...
Effective solving of constraint problems often requires choosing good or specific search heuristics....
On-policy reinforcement learning provides online adaptation, a characteristic of intelligent systems...
This paper presents two general approaches that address the problems of the local character of the s...
How e#ciently can we search an unknown environment for a goal in unknown position? How much would i...
Abstract. This paper presents a framework allowing to tune continual explo-ration in an optimal way....
Reinforcement learning requires exploration, leading to repeated execution of sub-optimal actions. N...
We investigate the role of learning in search-based systems for solving optimization problems. We us...
In search problems, a mobile searcher seeks to locate a target that hides in some unknown position o...