This thesis presents a modified Q-learning algorithm and provides conditions for convergence to a pure Nash equilibrium in potential games. In general Q-learning schemes, convergence to a Nash equilibrium may require decreasing step-sizes and long learning time. In this thesis, we consider a modified Q-learning algorithm based on constant step-sizes, inspired by Joint Strategy Fictitious Play (JSFP). When compared to JSFP, the Q-learning with constant step-sizes requires less information aggregation, but only reaches an approximation of a Nash equilibrium. We show that by appropriately choosing frequency dependent step-sizes, sufficient exploration of all actions is ensured and the estimated equilibrium approaches a Nash equilibrium.M.A.S
Hirsch [2], is called smooth fictitious play. Using techniques from stochastic approximation by the ...
The single-agent multi-armed bandit problem can be solved by an agent that learns the values of each...
Abstract The Nash equilibrium concept has previously been shown to be an important tool to understan...
This thesis presents a modified Q-learning algorithm and provides conditions for convergence to a pu...
We extend Q-learning to a noncooperative multiagent context, using the framework of general-sum stoc...
This thesis involves the use of a reinforcement learning algorithm (RL) called Q-learning to train a...
Recently, there have been several attempts to design multiagent Q-learning algorithms that learn equ...
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that a...
Some game theory approaches to solve multiagent reinforce-ment learning in self play, i.e. when agen...
This paper introduces Correlated-Q (CE-Q) learning, a multiagent Q-learning algorithm based on the c...
This paper introduces Correlated-Q (CE-Q) learning, a multiagent Q-learning algorithm based on the c...
This paper describes an approach to rein-forcement learning in multiagent general-sum games in which...
Abstract—Learning processes that converge to mixed-strategy equilibria often exhibit learning only i...
This article investigates the performance of independent reinforcement learners in multi-agent games...
The paper studies the highly prototypical Fictitious Play (FP) algorithm, as well as a broad class o...
Hirsch [2], is called smooth fictitious play. Using techniques from stochastic approximation by the ...
The single-agent multi-armed bandit problem can be solved by an agent that learns the values of each...
Abstract The Nash equilibrium concept has previously been shown to be an important tool to understan...
This thesis presents a modified Q-learning algorithm and provides conditions for convergence to a pu...
We extend Q-learning to a noncooperative multiagent context, using the framework of general-sum stoc...
This thesis involves the use of a reinforcement learning algorithm (RL) called Q-learning to train a...
Recently, there have been several attempts to design multiagent Q-learning algorithms that learn equ...
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that a...
Some game theory approaches to solve multiagent reinforce-ment learning in self play, i.e. when agen...
This paper introduces Correlated-Q (CE-Q) learning, a multiagent Q-learning algorithm based on the c...
This paper introduces Correlated-Q (CE-Q) learning, a multiagent Q-learning algorithm based on the c...
This paper describes an approach to rein-forcement learning in multiagent general-sum games in which...
Abstract—Learning processes that converge to mixed-strategy equilibria often exhibit learning only i...
This article investigates the performance of independent reinforcement learners in multi-agent games...
The paper studies the highly prototypical Fictitious Play (FP) algorithm, as well as a broad class o...
Hirsch [2], is called smooth fictitious play. Using techniques from stochastic approximation by the ...
The single-agent multi-armed bandit problem can be solved by an agent that learns the values of each...
Abstract The Nash equilibrium concept has previously been shown to be an important tool to understan...