This paper proposes a set of new error criteria and a learning approach, called Adaptive Normalized Risk-Averting Training (ANRAT) to attack the non-convex optimization problem in training deep neural networks without pretraining. Theoretically, we demonstrate its effectiveness based on the expansion of the convexity region. By analyzing the gradient on the convexity index $\lambda$, we explain the reason why our learning method using gradient descent works. In practice, we show how this training method is successfully applied for improved training of deep neural networks to solve visual recognition tasks on the MNIST and CIFAR-10 datasets. Using simple experimental settings without pretraining and other tricks, we obtain results comparable...
The success of deep neural networks is in part due to the use of normalization layers. Normalization...
Despite powerful representation ability, deep neural networks (DNNs) are prone to over-fitting, beca...
Deep neural networks have become the state-of-the-art tool to solve many computer vision problems. H...
© 2017 IEEE. Training deep neural networks is difficult for the pathological curvature problem. Re-p...
International audienceWe introduce a general framework for designing and training neural network lay...
We present DANTE, a novel method for training neural networks using the alternating minimization pri...
In this thesis, we theoretically analyze the ability of neural networks trained by gradient descent ...
Optimization is the key component of deep learning. Increasing depth, which is vital for reaching a...
In the recent decade, deep neural networks have solved ever more complex tasks across many fronts in...
Training neural networks is a challenging non-convex optimization problem, and backpropagation or gr...
Deep Learning has become interestingly popular in the field of computer vision, mostly attaining ne...
This paper proposes a new family of algorithms for training neural networks (NNs). These...
Batch Normalization (BN) is an essential component of the Deep Neural Networks (DNNs) architectures....
Recently smoothing deep neural network based classifiers via isotropic Gaussian perturbation is show...
Convexity has recently received a lot of attention in the machine learning community, and the lack o...
The success of deep neural networks is in part due to the use of normalization layers. Normalization...
Despite powerful representation ability, deep neural networks (DNNs) are prone to over-fitting, beca...
Deep neural networks have become the state-of-the-art tool to solve many computer vision problems. H...
© 2017 IEEE. Training deep neural networks is difficult for the pathological curvature problem. Re-p...
International audienceWe introduce a general framework for designing and training neural network lay...
We present DANTE, a novel method for training neural networks using the alternating minimization pri...
In this thesis, we theoretically analyze the ability of neural networks trained by gradient descent ...
Optimization is the key component of deep learning. Increasing depth, which is vital for reaching a...
In the recent decade, deep neural networks have solved ever more complex tasks across many fronts in...
Training neural networks is a challenging non-convex optimization problem, and backpropagation or gr...
Deep Learning has become interestingly popular in the field of computer vision, mostly attaining ne...
This paper proposes a new family of algorithms for training neural networks (NNs). These...
Batch Normalization (BN) is an essential component of the Deep Neural Networks (DNNs) architectures....
Recently smoothing deep neural network based classifiers via isotropic Gaussian perturbation is show...
Convexity has recently received a lot of attention in the machine learning community, and the lack o...
The success of deep neural networks is in part due to the use of normalization layers. Normalization...
Despite powerful representation ability, deep neural networks (DNNs) are prone to over-fitting, beca...
Deep neural networks have become the state-of-the-art tool to solve many computer vision problems. H...