This paper introduces a novel payoff-based learning scheme for distributed optimization in repeatedly-played strategic-form games. Standard reinforcement-based learning exhibits several limitations with respect to their asymptotic stability. For example, in two-player coordination games, payoff-dominant (or efficient) Nash equilibria may not be stochastically stable. In this work, we present an extension of perturbed learning automata, namely aspiration-based perturbed learning automata (APLA) that overcomes these limitations. We provide a stochastic stability analysis in multi-player coordination games. In the case of two-player coordination games, we show that the payoff-dominant Nash equilibrium is the unique stochastically stable state
We propose a multi-agent reinforcement learning dynamics, and analyze its convergence properties in ...
This article investigates the performance of independent reinforcement learners in multi-agent games...
This paper provides a genera1 framework to analyze rational learning in strategic situations where t...
This paper considers a class of reinforcement-based learning (namely, perturbed learning automata) a...
This paper considers a class of reinforcement-learning that belongs to the family of Learning Automa...
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that a...
We propose a simple payoff-based learning rule that is completely decentralized, and that leads to a...
We propose a simple payoff-based learning rule that is completely decentralized, and that leads to a...
International audienceStarting from a heuristic learning scheme for N-person games, we derive a new ...
This paper considers a multi-person discrete game with random payoffs. The distribution of the rando...
We consider the ability of economic agents to learn in a decentralized envi-ronment in which agents ...
Fudenberg and Kreps (1993) consider adaptive learning processes, in the spirit of ctitious play, for...
We investigate the stability of mixed strategy equilibria in 2 person (bimatrix) games under perturb...
International audienceMotivated by the scarcity of accurate payoff feedback in practical application...
International audienceWhile payoff-based learning models are almost exclusively devised for finite a...
We propose a multi-agent reinforcement learning dynamics, and analyze its convergence properties in ...
This article investigates the performance of independent reinforcement learners in multi-agent games...
This paper provides a genera1 framework to analyze rational learning in strategic situations where t...
This paper considers a class of reinforcement-based learning (namely, perturbed learning automata) a...
This paper considers a class of reinforcement-learning that belongs to the family of Learning Automa...
In this paper, we address the problem of convergence to Nash equilibria in games with rewards that a...
We propose a simple payoff-based learning rule that is completely decentralized, and that leads to a...
We propose a simple payoff-based learning rule that is completely decentralized, and that leads to a...
International audienceStarting from a heuristic learning scheme for N-person games, we derive a new ...
This paper considers a multi-person discrete game with random payoffs. The distribution of the rando...
We consider the ability of economic agents to learn in a decentralized envi-ronment in which agents ...
Fudenberg and Kreps (1993) consider adaptive learning processes, in the spirit of ctitious play, for...
We investigate the stability of mixed strategy equilibria in 2 person (bimatrix) games under perturb...
International audienceMotivated by the scarcity of accurate payoff feedback in practical application...
International audienceWhile payoff-based learning models are almost exclusively devised for finite a...
We propose a multi-agent reinforcement learning dynamics, and analyze its convergence properties in ...
This article investigates the performance of independent reinforcement learners in multi-agent games...
This paper provides a genera1 framework to analyze rational learning in strategic situations where t...