Neural SDEs combine many of the best qualities of both RNNs and SDEs: memory efficient training, high-capacity function approximation, and strong priors on model space. This makes them a natural choice for modelling many types of temporal dynamics. Training a Neural SDE (either as a VAE or as a GAN) requires backpropagating through an SDE solve. This may be done by solving a backwards-in-time SDE whose solution is the desired parameter gradients. However, this has previously suffered from severe speed and accuracy issues, due to high computational cost and numerical truncation errors. Here, we overcome these issues through several technical innovations. First, we introduce the reversible Heun method. This is a new SDE solver that is algebra...
Most machine learning methods are used as a black box for modelling. We may try to extract some know...
We investigate a new approach to compute the gradients of artificial neural networks (ANNs), based o...
We provide the first experimental results on non-synthetic datasets for the quasi-diagonal Riemannia...
Stochastic differential equations (SDEs) are a staple of mathematical modelling of temporal dynamics...
Neural SDEs with Brownian motion as noise lead to smoother attributions than traditional ResNets. Va...
The conjoining of dynamical systems and deep learning has become a topic of great interest. In parti...
Diffusion models have shown remarkable performance on many generative tasks. Despite recent success,...
We describe the neural-network training framework used in the Kaldi speech recogni-tion toolkit, whi...
Training Neural Ordinary Differential Equations (ODEs) is often computationally expensive. Indeed, c...
Deep neural networks currently play a prominent role in solving problems across a wide variety of di...
This paper proposes the Mesh Neural Network (MNN), a novel architecture which allows neurons to be c...
To enable learning on edge devices with fast convergence and low memory, we present a novel backprop...
Unresolved gradients produce numerical oscillations and inaccurate results. The most straightforward...
Recently, brain-inspired spiking neuron networks (SNNs) have attracted widespread research interest ...
On‐chip training of neural networks (NNs) is regarded as a promising training method for neuromorphi...
Most machine learning methods are used as a black box for modelling. We may try to extract some know...
We investigate a new approach to compute the gradients of artificial neural networks (ANNs), based o...
We provide the first experimental results on non-synthetic datasets for the quasi-diagonal Riemannia...
Stochastic differential equations (SDEs) are a staple of mathematical modelling of temporal dynamics...
Neural SDEs with Brownian motion as noise lead to smoother attributions than traditional ResNets. Va...
The conjoining of dynamical systems and deep learning has become a topic of great interest. In parti...
Diffusion models have shown remarkable performance on many generative tasks. Despite recent success,...
We describe the neural-network training framework used in the Kaldi speech recogni-tion toolkit, whi...
Training Neural Ordinary Differential Equations (ODEs) is often computationally expensive. Indeed, c...
Deep neural networks currently play a prominent role in solving problems across a wide variety of di...
This paper proposes the Mesh Neural Network (MNN), a novel architecture which allows neurons to be c...
To enable learning on edge devices with fast convergence and low memory, we present a novel backprop...
Unresolved gradients produce numerical oscillations and inaccurate results. The most straightforward...
Recently, brain-inspired spiking neuron networks (SNNs) have attracted widespread research interest ...
On‐chip training of neural networks (NNs) is regarded as a promising training method for neuromorphi...
Most machine learning methods are used as a black box for modelling. We may try to extract some know...
We investigate a new approach to compute the gradients of artificial neural networks (ANNs), based o...
We provide the first experimental results on non-synthetic datasets for the quasi-diagonal Riemannia...