The highly non-linear nature of deep neural networks causes them to be susceptible to adversarial examples and have unstable gradients which hinders interpretability. However, existing methods to solve these issues, such as adversarial training, are expensive and often sacrifice predictive accuracy. In this work, we consider curvature, which is a mathematical quantity which encodes the degree of non-linearity. Using this, we demonstrate low-curvature neural networks (LCNNs) that obtain drastically lower curvature than standard models while exhibiting similar predictive performance, which leads to improved robustness and stable gradients, with only a marginally increased training time. To achieve this, we minimize a data-independent upper ...
Littmann E, Ritter H. Curvature estimation with a DCA neural network. In: Deutsche Arbeitsgemeinscha...
Neural networks are an important class of highly flexible and powerful models inspired by the struct...
International audienceDeep neural networks of sizes commonly encountered in practice are proven to c...
© 2017 IEEE. Training deep neural networks is difficult for the pathological curvature problem. Re-p...
In the recent decade, deep neural networks have solved ever more complex tasks across many fronts in...
Solving large scale optimization problems, such as neural networks training, can present many challe...
Second-order optimization methods applied to train deep neural net- works use the curvature informat...
We present an error-neural-modeling-based strategy for approximating two-dimensional curvature in th...
The success of deep learning has revealed the application potential of neural networks across the sc...
The performance of feed-forward neural networks in real applications can be often be improved signif...
Across scientific and engineering disciplines, the algorithmic pipeline forprocessing and understand...
International audienceGraph Neural Networks (GNNs) have succeeded in various computer science applic...
When training overparameterized deep networks for classification tasks, it has been widely observed ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences, 2...
In this dissertation, we are concerned with the advancement of optimization algorithms for training ...
Littmann E, Ritter H. Curvature estimation with a DCA neural network. In: Deutsche Arbeitsgemeinscha...
Neural networks are an important class of highly flexible and powerful models inspired by the struct...
International audienceDeep neural networks of sizes commonly encountered in practice are proven to c...
© 2017 IEEE. Training deep neural networks is difficult for the pathological curvature problem. Re-p...
In the recent decade, deep neural networks have solved ever more complex tasks across many fronts in...
Solving large scale optimization problems, such as neural networks training, can present many challe...
Second-order optimization methods applied to train deep neural net- works use the curvature informat...
We present an error-neural-modeling-based strategy for approximating two-dimensional curvature in th...
The success of deep learning has revealed the application potential of neural networks across the sc...
The performance of feed-forward neural networks in real applications can be often be improved signif...
Across scientific and engineering disciplines, the algorithmic pipeline forprocessing and understand...
International audienceGraph Neural Networks (GNNs) have succeeded in various computer science applic...
When training overparameterized deep networks for classification tasks, it has been widely observed ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences, 2...
In this dissertation, we are concerned with the advancement of optimization algorithms for training ...
Littmann E, Ritter H. Curvature estimation with a DCA neural network. In: Deutsche Arbeitsgemeinscha...
Neural networks are an important class of highly flexible and powerful models inspired by the struct...
International audienceDeep neural networks of sizes commonly encountered in practice are proven to c...