Distributed optimization can be formulated as an n-player coordination game. One of the most common learning techniques in game theory is fictitious play and its variations. However, fictitious play is founded on an implicit assumption that opponents’ strategies are stationary. In this paper we present a new variation of fictitious play in which players predict opponents’ strategy using a particle filter algorithm. This allows us to use a more realistic model of opponent strategy. We used pre-specified opponents’ strategies to examine if our algorithm can efficiently track the strategies. Furthermore, we have used these experiments to examine the impact of different values of our algorithm parameters on the results of strategy tracking. We ...
<p>This dissertation studies multi-agent algorithms for learning Nash equilibrium strategies in game...
Planning how to interact against bounded memory and unbounded memory learning opponents needs differ...
Planning how to interact against bounded memory and unbounded memory learning opponents needs differ...
Distributed optimization can be formulated as an n player coordination game. One of the most common ...
Distributed optimization can be formulated as an n player coordination game. One of the most common ...
Abstract. It is now well known that decentralised optimisation can be formulated as a potential game...
It is now well known that decentralised optimisation can be formulated as a potential game, and game...
This paper casts coordination of a team of robots within the framework of game theoretic learning al...
<p>The paper is concerned with distributed learning in large-scale games. The well-known fictitious ...
This paper casts coordination of a team of robots within the framework of game theoretic learning al...
Potential games and decentralised partially observable MDPs (Dec–POMDPs) are two commonly used model...
This paper casts coordination of a team of robots within the framework of game theoretic learning al...
This paper casts coordination of a team of robots within the framework of game theoretic learning al...
This dissertation studies multi-agent algorithms for learning Nash equilibrium strategies in games w...
Potential games and decentralised partially observable MDPs (Dec-POMDPs) are two commonly used model...
<p>This dissertation studies multi-agent algorithms for learning Nash equilibrium strategies in game...
Planning how to interact against bounded memory and unbounded memory learning opponents needs differ...
Planning how to interact against bounded memory and unbounded memory learning opponents needs differ...
Distributed optimization can be formulated as an n player coordination game. One of the most common ...
Distributed optimization can be formulated as an n player coordination game. One of the most common ...
Abstract. It is now well known that decentralised optimisation can be formulated as a potential game...
It is now well known that decentralised optimisation can be formulated as a potential game, and game...
This paper casts coordination of a team of robots within the framework of game theoretic learning al...
<p>The paper is concerned with distributed learning in large-scale games. The well-known fictitious ...
This paper casts coordination of a team of robots within the framework of game theoretic learning al...
Potential games and decentralised partially observable MDPs (Dec–POMDPs) are two commonly used model...
This paper casts coordination of a team of robots within the framework of game theoretic learning al...
This paper casts coordination of a team of robots within the framework of game theoretic learning al...
This dissertation studies multi-agent algorithms for learning Nash equilibrium strategies in games w...
Potential games and decentralised partially observable MDPs (Dec-POMDPs) are two commonly used model...
<p>This dissertation studies multi-agent algorithms for learning Nash equilibrium strategies in game...
Planning how to interact against bounded memory and unbounded memory learning opponents needs differ...
Planning how to interact against bounded memory and unbounded memory learning opponents needs differ...