We present a probabilistic analysis of the long-time behaviour of the nonlocal, diffusive equations with a gradient flow structure in 2-Wasserstein metric, namely, the Mean-Field Langevin Dynamics (MFLD). Our work is motivated by a desire to provide a theoretical underpinning for the convergence of stochastic gradient type algorithms widely used for non-convex learning tasks such as training of deep neural networks. The key insight is that the certain class of the finite dimensional non-convex problems becomes convex when lifted to infinite dimensional space of measures. We leverage this observation and show that the corresponding energy functional defined on the space of probability measures has a unique minimiser which can be characterise...
International audienceIn this paper, we study a class of games regularized by relative entropy where...
International audienceIn this paper we study a type of games regularized by the relative entropy, wh...
Two independent subjects are studied in this thesis, the first of which consists of two distinct pro...
We present a probabilistic analysis of the long-time behaviour of the nonlocal, diffusive equations ...
We analyze in a closed form the learning dynamics of the stochastic gradient descent (SGD) for a sin...
8 pages + appendix, 4 figuresInternational audienceWe analyze in a closed form the learning dynamics...
We study the global convergence of policy gradient for infinite-horizon, continuous state and action...
In this paper, we study a regularised relaxed optimal control problem and, in particular, we are con...
Nowadays neural networks are a powerful tool, even if there are few mathematical results that explai...
Effective training of deep neural networks suffers from two main issues. The first is that the param...
Stochastic-gradient sampling methods are often used to perform Bayesian inference on neural networks...
Best Paper AwardInternational audienceOne way to avoid overfitting in machine learning is to use mod...
We study the problem of policy optimization for infinite-horizon discounted Markov Decision Process...
Despite the non-convex optimization landscape, over-parametrized shallow networks are able to achiev...
Machine learning, and in particular neural network models, have revolutionized fields such as image,...
International audienceIn this paper, we study a class of games regularized by relative entropy where...
International audienceIn this paper we study a type of games regularized by the relative entropy, wh...
Two independent subjects are studied in this thesis, the first of which consists of two distinct pro...
We present a probabilistic analysis of the long-time behaviour of the nonlocal, diffusive equations ...
We analyze in a closed form the learning dynamics of the stochastic gradient descent (SGD) for a sin...
8 pages + appendix, 4 figuresInternational audienceWe analyze in a closed form the learning dynamics...
We study the global convergence of policy gradient for infinite-horizon, continuous state and action...
In this paper, we study a regularised relaxed optimal control problem and, in particular, we are con...
Nowadays neural networks are a powerful tool, even if there are few mathematical results that explai...
Effective training of deep neural networks suffers from two main issues. The first is that the param...
Stochastic-gradient sampling methods are often used to perform Bayesian inference on neural networks...
Best Paper AwardInternational audienceOne way to avoid overfitting in machine learning is to use mod...
We study the problem of policy optimization for infinite-horizon discounted Markov Decision Process...
Despite the non-convex optimization landscape, over-parametrized shallow networks are able to achiev...
Machine learning, and in particular neural network models, have revolutionized fields such as image,...
International audienceIn this paper, we study a class of games regularized by relative entropy where...
International audienceIn this paper we study a type of games regularized by the relative entropy, wh...
Two independent subjects are studied in this thesis, the first of which consists of two distinct pro...