This paper shows that the distributed representation found in Learning Vector Quantization (LVQ) enables reinforcement learning methods to cope with a large decision search space, defined in terms of equivalence classes of input patterns like those found in the game of Go. In particular, this paper describes S[arsa]LVQ, a novel reinforcement learning algorithm and shows its feasibility for pattern-based inference in Go. As the distributed LVQ representation corresponds to a (quantized) codebook of compressed and generalized pattern templates, the state space requirements for online reinforcement methods are significantly reduced, thus decreasing the complexity of the decision space and consequently improving the play performance from patter...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
The convergence property of reinforcement learning has been extensively investigated in the field of...
We introduce a novel type of actor-critic approach for deep reinforcement learning which is based on...
On-policy reinforcement learning provides online adaptation, a characteristic of intelligent systems...
Reinforcement learning has proven to be a set of successful techniques for nding optimal policies ...
Reinforcement learning is applied to computer-based playing of 5x5 Go. We have found that incorporat...
Proceeding of: RoboCup-99: Robot Soccer World Cup III, July 27 to August 6, 1999, Stockholm, SwedenR...
In this thesis we develop a unified framework for reinforcement learning and simulation-based search...
Reinforcement learning is a general and powerful way to formulate complex learning problems and acqu...
Autonomous automata should not only be able to learn how to behave efficiently in any predefined int...
This paper describes a novel hybrid reinforcement learning algorithm, Sarsa Learning Vector Quantiza...
General Game Playing agents are required to play games they have never seen before simply by looking...
In reinforcement learning (RL), an agent interacts with the environment by taking actions and observ...
Go is an ancient board game that poses unique opportunities and challenges for artificial intelligen...
The game of go is an ideal problem domain for exploring machine learning: it is easy to define and t...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
The convergence property of reinforcement learning has been extensively investigated in the field of...
We introduce a novel type of actor-critic approach for deep reinforcement learning which is based on...
On-policy reinforcement learning provides online adaptation, a characteristic of intelligent systems...
Reinforcement learning has proven to be a set of successful techniques for nding optimal policies ...
Reinforcement learning is applied to computer-based playing of 5x5 Go. We have found that incorporat...
Proceeding of: RoboCup-99: Robot Soccer World Cup III, July 27 to August 6, 1999, Stockholm, SwedenR...
In this thesis we develop a unified framework for reinforcement learning and simulation-based search...
Reinforcement learning is a general and powerful way to formulate complex learning problems and acqu...
Autonomous automata should not only be able to learn how to behave efficiently in any predefined int...
This paper describes a novel hybrid reinforcement learning algorithm, Sarsa Learning Vector Quantiza...
General Game Playing agents are required to play games they have never seen before simply by looking...
In reinforcement learning (RL), an agent interacts with the environment by taking actions and observ...
Go is an ancient board game that poses unique opportunities and challenges for artificial intelligen...
The game of go is an ideal problem domain for exploring machine learning: it is easy to define and t...
Sequentially making-decision abounds in real-world problems ranging from robots needing to interact ...
The convergence property of reinforcement learning has been extensively investigated in the field of...
We introduce a novel type of actor-critic approach for deep reinforcement learning which is based on...