Solving strategic games with huge action spaces is a critical yet under-explored topic in economics, operations research and artificial intelligence. This paper proposes new learning algorithms for solving two-player zero-sum normal-form games where the number of pure strategies is prohibitively large. Specifically, we combine no-regret analysis from online learning with Double Oracle (DO) from game theory. Our method---\emph{Online Double Oracle (ODO)}---is provably convergent to a Nash equilibrium (NE). Most importantly, unlike normal DO, ODO is \emph{rational} in the sense that each agent in ODO can exploit a strategic adversary with a regret bound of $\mathcal{O}(\sqrt{ k \log(k)/T})$, where $k$ is not the total number of pure str...
This dissertation studies multi-agent algorithms for learning Nash equilibrium strategies in games w...
One of the most popular methods for learning Nash equilibrium (NE) in large-scale imperfect informat...
In this paper, we study the learning problem in two-player general-sum Markov Games. We consider the...
Our work considers repeated games in which one player has a different objective than others. In part...
Developing scalable solution algorithms is one of the central problems in computational game theory....
With the recent advances in solving large, zero-sum extensive form games, there is a growing interes...
We propose a novel online learning method for minimizing regret in large extensive-form games. The a...
We study a novel setting in Online Markov Decision Processes (OMDPs) where the loss function is chos...
Many search and security games played on a graph can be modeled as normal-form zero-sum games with s...
Policy Space Response Oracle methods (PSRO) provide a general solution to learn Nash equilibrium in ...
Koller, Megiddo and von Stengel showed how to efficiently compute minimax strategies for two-player ...
International audience2-Player games in general provide a popular platform for research in Artificia...
Predicting strategic goal-oriented multi-agent behavior from observations of play is a ubiquitous ta...
This paper proposes novel, end-to-end deep reinforcement learning algorithms for learning two-player...
We consider the fundamental problem of prediction with expert advice where the experts are "optimiza...
This dissertation studies multi-agent algorithms for learning Nash equilibrium strategies in games w...
One of the most popular methods for learning Nash equilibrium (NE) in large-scale imperfect informat...
In this paper, we study the learning problem in two-player general-sum Markov Games. We consider the...
Our work considers repeated games in which one player has a different objective than others. In part...
Developing scalable solution algorithms is one of the central problems in computational game theory....
With the recent advances in solving large, zero-sum extensive form games, there is a growing interes...
We propose a novel online learning method for minimizing regret in large extensive-form games. The a...
We study a novel setting in Online Markov Decision Processes (OMDPs) where the loss function is chos...
Many search and security games played on a graph can be modeled as normal-form zero-sum games with s...
Policy Space Response Oracle methods (PSRO) provide a general solution to learn Nash equilibrium in ...
Koller, Megiddo and von Stengel showed how to efficiently compute minimax strategies for two-player ...
International audience2-Player games in general provide a popular platform for research in Artificia...
Predicting strategic goal-oriented multi-agent behavior from observations of play is a ubiquitous ta...
This paper proposes novel, end-to-end deep reinforcement learning algorithms for learning two-player...
We consider the fundamental problem of prediction with expert advice where the experts are "optimiza...
This dissertation studies multi-agent algorithms for learning Nash equilibrium strategies in games w...
One of the most popular methods for learning Nash equilibrium (NE) in large-scale imperfect informat...
In this paper, we study the learning problem in two-player general-sum Markov Games. We consider the...