AbstractWe present a new algorithm for polynomial time learning of optimal behavior in single-controller stochastic games. This algorithm incorporates and integrates important recent results of Kearns and Singh (Proc. ICML-98, 1998) in reinforcement learning and of Monderer and Tennenholtz (J. Artif. Intell. Res. 7, 1997, p. 231) in repeated games. In stochastic games, the agent must cope with the existence of an adversary whose actions can be arbitrary. In particular, this adversary can withhold information about the game matrix by refraining from (or rarely) performing certain actions. This forces upon us an exploration versus exploitation dilemma more complex than in Markov decision processes in which, given information about particular ...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
In this paper, we give an overview of recently developed machine learning methods for stochastic con...
We consider reinforcement learning algorithms in normal form games. Using two-timescales stochastic ...
We present a new algorithm for polynomial time learning of optimal behavior in stochastic games. Thi...
Inspired by recent results on polynomial time reinforcement algorithms that accumulate near-optimal ...
We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on ...
Solving multi-agent reinforcement learning problems has proven difficult because of the lack of trac...
Learning to play in the presence of independent and self-motivated opponents is a difficult task, be...
AbstractWe present new algorithms for determining optimal strategies for two-player games with proba...
Learning behaviors in a multiagent environment is crucial for developing and adapting multiagent sys...
We survey a number of algorithms for the simple stochastic game problem, which is to determine the w...
We present a novel and uniform formulation of the problem of reinforcement learning against bounded ...
Game theory proved to be very useful in the field of verification of open reactive systems. This is ...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
AbstractLearning to act in a multiagent environment is a difficult problem since the normal definiti...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
In this paper, we give an overview of recently developed machine learning methods for stochastic con...
We consider reinforcement learning algorithms in normal form games. Using two-timescales stochastic ...
We present a new algorithm for polynomial time learning of optimal behavior in stochastic games. Thi...
Inspired by recent results on polynomial time reinforcement algorithms that accumulate near-optimal ...
We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on ...
Solving multi-agent reinforcement learning problems has proven difficult because of the lack of trac...
Learning to play in the presence of independent and self-motivated opponents is a difficult task, be...
AbstractWe present new algorithms for determining optimal strategies for two-player games with proba...
Learning behaviors in a multiagent environment is crucial for developing and adapting multiagent sys...
We survey a number of algorithms for the simple stochastic game problem, which is to determine the w...
We present a novel and uniform formulation of the problem of reinforcement learning against bounded ...
Game theory proved to be very useful in the field of verification of open reactive systems. This is ...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
AbstractLearning to act in a multiagent environment is a difficult problem since the normal definiti...
In this paper we introduce a new multi-agent reinforcement learning algorithm, called exploring self...
In this paper, we give an overview of recently developed machine learning methods for stochastic con...
We consider reinforcement learning algorithms in normal form games. Using two-timescales stochastic ...