International audienceIn this paper we study a type of games regularized by the relative entropy, where the players' strategies are coupled through a random environment variable. Besides the existence and the uniqueness of equilibria of such games, we prove that the marginal laws of the corresponding mean-field Langevin systems can converge towards the games' equilibria in different settings. As applications, the dynamic games can be treated as games on a random environment when one treats the time horizon as the environment. In practice, our results can be applied to analysing the stochastic gradient descent algorithm for deep neural networks in the context of supervised learning as well as for the generative adversarial networks
Abstract. Starting from a heuristic learning scheme for strategic n-person games, we de-rive a new c...
In this paper, we study a regularised relaxed optimal control problem and, in particular, we are con...
Current distributed routing control algorithms for dynamic networks model networks using the time ev...
International audienceIn this paper, we study a class of games regularized by relative entropy where...
In this paper, we introduce a new class of game dynamics made of a pay-off replicator-like term modu...
We study a class of stochastic dynamic games that exhibit strategic complementarities between player...
We analyse the connection between Mean Field Games (MFGs) and a popular Machine Learning model, name...
We study maximum entropy correlated equilibria (Maxent CE) in multi-player games. After motivating a...
ABSTRACT. The purpose of this paper is to provide a complete probabilistic analysis of a large class...
The recent mean field game (MFG) formalism facilitates otherwise intractable computation of approxim...
The purpose of this paper is to provide a complete probabilistic analysis of a large class of stocha...
The purpose of this paper is to provide a complete probabilistic analysis of a large class of stocha...
International audienceWe consider a class of stochastic games with finite number of resource states,...
The collective behaviour of stochastic multi-agents swarms driven by Gaussian and non-Gaussian envir...
We present a probabilistic analysis of the long-time behaviour of the nonlocal, diffusive equations ...
Abstract. Starting from a heuristic learning scheme for strategic n-person games, we de-rive a new c...
In this paper, we study a regularised relaxed optimal control problem and, in particular, we are con...
Current distributed routing control algorithms for dynamic networks model networks using the time ev...
International audienceIn this paper, we study a class of games regularized by relative entropy where...
In this paper, we introduce a new class of game dynamics made of a pay-off replicator-like term modu...
We study a class of stochastic dynamic games that exhibit strategic complementarities between player...
We analyse the connection between Mean Field Games (MFGs) and a popular Machine Learning model, name...
We study maximum entropy correlated equilibria (Maxent CE) in multi-player games. After motivating a...
ABSTRACT. The purpose of this paper is to provide a complete probabilistic analysis of a large class...
The recent mean field game (MFG) formalism facilitates otherwise intractable computation of approxim...
The purpose of this paper is to provide a complete probabilistic analysis of a large class of stocha...
The purpose of this paper is to provide a complete probabilistic analysis of a large class of stocha...
International audienceWe consider a class of stochastic games with finite number of resource states,...
The collective behaviour of stochastic multi-agents swarms driven by Gaussian and non-Gaussian envir...
We present a probabilistic analysis of the long-time behaviour of the nonlocal, diffusive equations ...
Abstract. Starting from a heuristic learning scheme for strategic n-person games, we de-rive a new c...
In this paper, we study a regularised relaxed optimal control problem and, in particular, we are con...
Current distributed routing control algorithms for dynamic networks model networks using the time ev...