Most research in reinforcement learning has focused on stationary environments. In this paper, we propose several adaptations of Q-learning for a dynamic environment, for both single and multiple agents. The environment consists of a grid of random rewards, where every reward is removed after a visit. We focus on experience replay, a technique that receives a lot of attention nowadays, and combine this method with Q-learning. We compare two variations of experience replay, where experiences are reused based on time or based on the obtained reward. For multi-agent reinforcement learning we compare two variations of policy representation. In the first variation the agents share a Q-function, while in the second variation both agents have a se...
Typically, a reinforcement learning agent interacts with the environment and learns how to select an...
The application of decision making and learning algorithms to multi-agent systems presents many inte...
The Reinforcement learning (RL) algorithms solve a wide range of problems we faced. The topic of RL ...
Most research in reinforcement learning has focused on stationary environments. In this paper, we pr...
On-line learning methods have been applied successfully in multi-agent systems to achieve coordinati...
Experience replay memory in reinforcement learning enables agents to remember and reuse past experie...
Many real-world problems, such as network packet routing and urban traffic control, are naturally mo...
Using neural networks as function approximators in temporal difference reinforcement problems proved...
Reinforcement learning is the problem faced by an agent that must learn behaviour through trial-and-...
Research Doctorate - Doctor of Philosophy (PhD)Machine learning in multi-agent domains poses several...
Cooperative reinforcement learning algorithms such as BEST-Q, AVE-Q, PSO-Q, and WSS use Q-value shar...
Although well understood in the single-agent framework, the use of traditional reinforcement learnin...
This paper investigates the use of experience generaliza-tion on concurrent and on-line policy learn...
Experience replay is one of the most commonly used approaches to improve the sample efficiency of re...
In this thesis, we first suggest a new type of Markov model extended by Watkins’ action replay proce...
Typically, a reinforcement learning agent interacts with the environment and learns how to select an...
The application of decision making and learning algorithms to multi-agent systems presents many inte...
The Reinforcement learning (RL) algorithms solve a wide range of problems we faced. The topic of RL ...
Most research in reinforcement learning has focused on stationary environments. In this paper, we pr...
On-line learning methods have been applied successfully in multi-agent systems to achieve coordinati...
Experience replay memory in reinforcement learning enables agents to remember and reuse past experie...
Many real-world problems, such as network packet routing and urban traffic control, are naturally mo...
Using neural networks as function approximators in temporal difference reinforcement problems proved...
Reinforcement learning is the problem faced by an agent that must learn behaviour through trial-and-...
Research Doctorate - Doctor of Philosophy (PhD)Machine learning in multi-agent domains poses several...
Cooperative reinforcement learning algorithms such as BEST-Q, AVE-Q, PSO-Q, and WSS use Q-value shar...
Although well understood in the single-agent framework, the use of traditional reinforcement learnin...
This paper investigates the use of experience generaliza-tion on concurrent and on-line policy learn...
Experience replay is one of the most commonly used approaches to improve the sample efficiency of re...
In this thesis, we first suggest a new type of Markov model extended by Watkins’ action replay proce...
Typically, a reinforcement learning agent interacts with the environment and learns how to select an...
The application of decision making and learning algorithms to multi-agent systems presents many inte...
The Reinforcement learning (RL) algorithms solve a wide range of problems we faced. The topic of RL ...