We present a novel and uniform formulation of the problem of reinforcement learning against bounded memory adaptive adversaries in repeated games, and the methodologies to accomplish learning in this novel framework. First we delineate a novel strategic definition of best response that optimises rewards over multiple steps, as opposed to the notion of tactical best response in game theory. We show that the problem of learning a strategic best response reduces to that of learning an optimal policy in a Markov Decision Process (MDP). We deal with both finite and infinite horizon versions of this problem. We adapt an existing Monte Carlo based algorithm for learning optimal policies in such MDPs over finite horizon, in polynomial time. We show...
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that a...
Learning to play in the presence of independent and self-motivated opponents is a difficult task, be...
This dissertation studies multi-agent algorithms for learning Nash equilibrium strategies in games w...
We present a novel and uniform formulation of the problem of reinforcement learning against bounded ...
We provide a uniform framework for learning against a recent history adversary in arbitrary repeated...
This paper investigates a relatively new direction in Mul-tiagent Reinforcement Learning. Most multi...
We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-...
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents ’ ...
This article investigates the performance of independent reinforcement learners in multi-agent games...
Planning how to interact against bounded memory and unbounded memory learning opponents needs differ...
Although multi-agent reinforcement learning can tackle systems of strategically interacting entities...
In this paper, we address multi-agent decision problems where all agents share a common goal. This c...
We present a new algorithm for polynomial time learning of optimal behavior in stochastic games. Thi...
Learning to converge to an efficient, i.e., Pareto-optimal Nash equilibrium of the repeated game is ...
The goal of a self-interested agent within a multi-agent system is to maximize its utility over time...
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that a...
Learning to play in the presence of independent and self-motivated opponents is a difficult task, be...
This dissertation studies multi-agent algorithms for learning Nash equilibrium strategies in games w...
We present a novel and uniform formulation of the problem of reinforcement learning against bounded ...
We provide a uniform framework for learning against a recent history adversary in arbitrary repeated...
This paper investigates a relatively new direction in Mul-tiagent Reinforcement Learning. Most multi...
We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-...
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents ’ ...
This article investigates the performance of independent reinforcement learners in multi-agent games...
Planning how to interact against bounded memory and unbounded memory learning opponents needs differ...
Although multi-agent reinforcement learning can tackle systems of strategically interacting entities...
In this paper, we address multi-agent decision problems where all agents share a common goal. This c...
We present a new algorithm for polynomial time learning of optimal behavior in stochastic games. Thi...
Learning to converge to an efficient, i.e., Pareto-optimal Nash equilibrium of the repeated game is ...
The goal of a self-interested agent within a multi-agent system is to maximize its utility over time...
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that a...
Learning to play in the presence of independent and self-motivated opponents is a difficult task, be...
This dissertation studies multi-agent algorithms for learning Nash equilibrium strategies in games w...