Learning with opponent-learning awareness

Foerster, J
Chen, R
Al-Shedivat, M
Whiteson, S
Abbeel, P
Mordatch, I

Publication date

January 2018

Publisher

International Foundation for Autonomous Agents and Multiagent Systems

Abstract

Multi-agent settings are quickly gathering importance in machine learning. This includes a plethora of recent work on deep multi-agent reinforcement learning, but also can be extended to hierarchical reinforcement learning, generative adversarial networks and decentralised optimization. In all these settings the presence of multiple learning agents renders the training problem non-stationary and often leads to unstable training or undesired final results. We present Learning with Opponent-Learning Awareness (LOLA), a method in which each agent shapes the anticipated learning of the other agents in the environment. The LOLA learning rule includes an additional term that accounts for the impact of one agent’s policy on the anticipated paramet...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Learning with opponent-learning awareness

Abstract

Extracted data

Learning with opponent-learning awareness

Abstract

Extracted data

Related items

Related items