Research Doctorate - Doctor of Philosophy (PhD)Machine learning in multi-agent domains poses several research challenges. One challenge is how to model cooperation between reinforcement learners. Cooperation between independent reinforcement learners is known to accelerate convergence to optimal solutions. In large state space problems, independent reinforcement learners normally cooperate to accelerate the learning process using decomposition techniques or knowledge sharing strategies. This thesis presents two techniques to multi-agent reinforcement learning and a comparison study. The first technique is a formal decomposition model and an algorithm for distributed systems. The second technique is a cooperative Q-learning algorithm for mul...
We report on an investigation of reinforcement learning techniques for the learning of coordination ...
peer reviewedIn this article we describe a set of scalable techniques for learning the behavior of a...
In this thesis, we first suggest a new type of Markov model extended by Watkins’ action replay proce...
The hierarchical organisation of distributed systems can provide an efficient decomposition for mach...
Cooperative reinforcement learning algorithms such as BEST-Q, AVE-Q, PSO-Q, and WSS use Q-value shar...
International audienceMulti-agent systems (MAS) are a field of study of growing interest in a variet...
Abstract—Multi-agent systems (MAS) are a field of study of growing interest in a variety of domains ...
Cooperative multi-agent systems (MAS) are finding applications in a wide variety of domains, includi...
We report on an investigation of reinforcement learning tech-niques for the learning of coordination...
Mutual learning is an emerging field in intelligent systems which takes inspiration from naturally i...
We present a conceptual framework for creating Qlearning-based algorithms that converge to optimal e...
On-line learning methods have been applied successfully in multi-agent systems to achieve coordinati...
Multi-Agent Reinforcement Learning (MARL) algorithms face two main difficulties: the curse of dimens...
© Springer Nature Switzerland AG 2018. This paper presents a proof-of concept study for demonstratin...
Abstract- This paper presents a cooperative reinforcement learning algorithm of multi-agent systems....
We report on an investigation of reinforcement learning techniques for the learning of coordination ...
peer reviewedIn this article we describe a set of scalable techniques for learning the behavior of a...
In this thesis, we first suggest a new type of Markov model extended by Watkins’ action replay proce...
The hierarchical organisation of distributed systems can provide an efficient decomposition for mach...
Cooperative reinforcement learning algorithms such as BEST-Q, AVE-Q, PSO-Q, and WSS use Q-value shar...
International audienceMulti-agent systems (MAS) are a field of study of growing interest in a variet...
Abstract—Multi-agent systems (MAS) are a field of study of growing interest in a variety of domains ...
Cooperative multi-agent systems (MAS) are finding applications in a wide variety of domains, includi...
We report on an investigation of reinforcement learning tech-niques for the learning of coordination...
Mutual learning is an emerging field in intelligent systems which takes inspiration from naturally i...
We present a conceptual framework for creating Qlearning-based algorithms that converge to optimal e...
On-line learning methods have been applied successfully in multi-agent systems to achieve coordinati...
Multi-Agent Reinforcement Learning (MARL) algorithms face two main difficulties: the curse of dimens...
© Springer Nature Switzerland AG 2018. This paper presents a proof-of concept study for demonstratin...
Abstract- This paper presents a cooperative reinforcement learning algorithm of multi-agent systems....
We report on an investigation of reinforcement learning techniques for the learning of coordination ...
peer reviewedIn this article we describe a set of scalable techniques for learning the behavior of a...
In this thesis, we first suggest a new type of Markov model extended by Watkins’ action replay proce...