We consider a repeated Prisoner’s Dilemma game where two independent learning agents play against each other. We assume that the players can observe each others ’ action but are oblivious to the payoff received by the other player. Multiagent learning literature has provided mechanisms that allow agents to converge to Nash Equilibrium. In this paper we define a special class of learner called a conditional joint action learner (CJAL) who attempts to learn the conditional probabil-ity of an action taken by the other given its own action and uses it to decide its next course of action. We prove that when played against itself, if the payoff structure of Prisoner’s Dilemma game satisfies certain conditions, using a limited exploration techniqu...
A number of experimental studies have investigated whether cooperative behavior may emerge in multi-...
We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-...
This paper addresses a mathematically tractable model of the Prisoner's Dilemma using the framework ...
We consider a repeated Prisoner’s Dilemma game where two independent learning agents play against ea...
Learning in the real world occurs when an agent, which perceives its current state and takes actions...
Learning to converge to an efficient, i.e., Pareto-optimal Nash equilibrium of the repeated game is ...
This paper studies the learning process carried out by two agents who are involved in many games. As...
Reinforcement learning can provide a robust and natural means for agents to learn how to coordinate ...
Multi-agent learning literature has looked at iterated twoplayer games to develop mechanisms that al...
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents ’ ...
In this paper (reinforcement) learning of decision makers that face many different games is studied....
Repeated play in games by simple adaptive agents is investigated. The agents use Q-learning, a speci...
The paper studies infinite repetition of finite strategic form games. Players use a learning behavio...
The Nash equilibrium, the main solution concept in analytical game theory, cannot make precise predi...
Multi-agent settings are quickly gathering importance in machine learning. This includes a plethora ...
A number of experimental studies have investigated whether cooperative behavior may emerge in multi-...
We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-...
This paper addresses a mathematically tractable model of the Prisoner's Dilemma using the framework ...
We consider a repeated Prisoner’s Dilemma game where two independent learning agents play against ea...
Learning in the real world occurs when an agent, which perceives its current state and takes actions...
Learning to converge to an efficient, i.e., Pareto-optimal Nash equilibrium of the repeated game is ...
This paper studies the learning process carried out by two agents who are involved in many games. As...
Reinforcement learning can provide a robust and natural means for agents to learn how to coordinate ...
Multi-agent learning literature has looked at iterated twoplayer games to develop mechanisms that al...
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents ’ ...
In this paper (reinforcement) learning of decision makers that face many different games is studied....
Repeated play in games by simple adaptive agents is investigated. The agents use Q-learning, a speci...
The paper studies infinite repetition of finite strategic form games. Players use a learning behavio...
The Nash equilibrium, the main solution concept in analytical game theory, cannot make precise predi...
Multi-agent settings are quickly gathering importance in machine learning. This includes a plethora ...
A number of experimental studies have investigated whether cooperative behavior may emerge in multi-...
We describe a generalized Q-learning type algorithm for reinforcement learning in competitive multi-...
This paper addresses a mathematically tractable model of the Prisoner's Dilemma using the framework ...