Opponent modeling is a critical mechanism in repeated games. It allows a player to adapt its strategy in order to better respond to the presumed preferences of his opponents. We introduce a new modeling technique that adaptively balances exploitability and risk reduction. An opponent’s strategy is modeled with a set of possible strategies that contain the actual strategy with a high probability. The algorithm is safe as the expected payoff is above the minimax payoff with a high probability, and can exploit the opponents’ preferences when sufficient observations have been obtained. We apply them to normal-form games and stochastic games with a finite number of stages. The performance of the proposed approach is first demonstrat...
This paper casts coordination of a team of robots within the framework of game theoretic learning al...
Distributed optimization can be formulated as an n player coordination game. One of the most common ...
. Agents that operate in a multi-agent system need an efficient strategy to handle their encounters ...
Opponent modeling is a critical mechanism in repeated games. It allows a player to adapt its strateg...
Opponent modeling is a critical mechanism in repeated games. It allows a player to adapt its strateg...
Research in opponent modelling has shown success, but a fundamental question has been overlooked: wh...
The main concern of this paper is the problem of opponent modeling. The goal of this work is to int...
University of Minnesota Ph.D. dissertation. August, 2008. Major: Computer Science. Advisor: Maria Gi...
When an opponent with a stationary and stochastic policy is encountered in a two-player competitive ...
Distributed optimization can be formulated as an n-player coordination game. One of the most common ...
This paper casts coordination of a team of robots within the framework of game theoretic learning al...
A key issue for a device involved in a competitive game is to be able to match the ability of the op...
Planning how to interact against bounded memory and unbounded memory learning opponents needs differ...
Human learning transfer takes advantage of important cognitive building blocks such as an abstract r...
This thesis investigates the use of Bayesian analysis upon an opponent's behaviour in order to deter...
This paper casts coordination of a team of robots within the framework of game theoretic learning al...
Distributed optimization can be formulated as an n player coordination game. One of the most common ...
. Agents that operate in a multi-agent system need an efficient strategy to handle their encounters ...
Opponent modeling is a critical mechanism in repeated games. It allows a player to adapt its strateg...
Opponent modeling is a critical mechanism in repeated games. It allows a player to adapt its strateg...
Research in opponent modelling has shown success, but a fundamental question has been overlooked: wh...
The main concern of this paper is the problem of opponent modeling. The goal of this work is to int...
University of Minnesota Ph.D. dissertation. August, 2008. Major: Computer Science. Advisor: Maria Gi...
When an opponent with a stationary and stochastic policy is encountered in a two-player competitive ...
Distributed optimization can be formulated as an n-player coordination game. One of the most common ...
This paper casts coordination of a team of robots within the framework of game theoretic learning al...
A key issue for a device involved in a competitive game is to be able to match the ability of the op...
Planning how to interact against bounded memory and unbounded memory learning opponents needs differ...
Human learning transfer takes advantage of important cognitive building blocks such as an abstract r...
This thesis investigates the use of Bayesian analysis upon an opponent's behaviour in order to deter...
This paper casts coordination of a team of robots within the framework of game theoretic learning al...
Distributed optimization can be formulated as an n player coordination game. One of the most common ...
. Agents that operate in a multi-agent system need an efficient strategy to handle their encounters ...