Restricted Boltzmann Machines (RBMs) are widely used as building blocks for deep learning models. Learning typically proceeds by using stochastic gradient descent, and the gradients are estimated with sampling methods. However, the gradient estimation is a computational bottleneck, so better use of the gradients will speed up the descent algorithm.To this end, we rst derive upper bounds on the RBM cost function, then show that descent methods can have natural advantages by operating in the `1 and Shatten-1 norm. We introduce a new method called \Stochastic Spectral Descent" that updates parameters in the normed space. Empirical results show dramatic improvements over stochastic gradient descent, and have only have a fractional increase on t...
International audienceThis text is the rejoinder following the discussion of a survey paper about mi...
AbstractIn this paper, we consider the solution of evolutionary Wente's problem with the wave operat...
This paper is concerned with the function $r_{k,s}(n)$, the number of (ordered) representations of $...
peer reviewedIn this paper, we study the behavior of pulse-coupled integrate-and-fire oscillators. E...
The vector difference equation ξk = Af(ξk−1)+εk, where (εk) is a square integrable difference marti...
Testing for the significance of a subset of regression coefficients in a linear model, a staple of s...
This paper analyzes applications of cumulant analysis in speech processing. A special focus is made...
AbstractA sufficient condition of stability of exponential Runge–Kutta methods for delay differentia...
AbstractThe numerical-analytic method is applied to a class of nonlinear differential-algebraic syst...
We correct two errors in our paper [4]. First error concerns the definition of the SVI solution, whe...
The aim of this paper is to solve the orthogonality equation with two unknown functions. This probl...
We extend the square of white noise algebra over the step functions on ℝ to the test function space ...
We give here a proof of the convergence of the Stochastic Gradient Descent (SGD) in a self-contained...
Approximation of some classes of random processes by cubic splines with given accuracy and reliabili...
In this paper we study a nonlinear evolution inclusion of subdifferential type in Hilbert spaces. T...
International audienceThis text is the rejoinder following the discussion of a survey paper about mi...
AbstractIn this paper, we consider the solution of evolutionary Wente's problem with the wave operat...
This paper is concerned with the function $r_{k,s}(n)$, the number of (ordered) representations of $...
peer reviewedIn this paper, we study the behavior of pulse-coupled integrate-and-fire oscillators. E...
The vector difference equation ξk = Af(ξk−1)+εk, where (εk) is a square integrable difference marti...
Testing for the significance of a subset of regression coefficients in a linear model, a staple of s...
This paper analyzes applications of cumulant analysis in speech processing. A special focus is made...
AbstractA sufficient condition of stability of exponential Runge–Kutta methods for delay differentia...
AbstractThe numerical-analytic method is applied to a class of nonlinear differential-algebraic syst...
We correct two errors in our paper [4]. First error concerns the definition of the SVI solution, whe...
The aim of this paper is to solve the orthogonality equation with two unknown functions. This probl...
We extend the square of white noise algebra over the step functions on ℝ to the test function space ...
We give here a proof of the convergence of the Stochastic Gradient Descent (SGD) in a self-contained...
Approximation of some classes of random processes by cubic splines with given accuracy and reliabili...
In this paper we study a nonlinear evolution inclusion of subdifferential type in Hilbert spaces. T...
International audienceThis text is the rejoinder following the discussion of a survey paper about mi...
AbstractIn this paper, we consider the solution of evolutionary Wente's problem with the wave operat...
This paper is concerned with the function $r_{k,s}(n)$, the number of (ordered) representations of $...