Neural networks are an important class of highly flexible and powerful models inspired by the structure of the brain. They consist of a sequence of interconnected layers, each comprised of basic computational units similar to the gates of a classical circuit. And like circuits, they have the capacity to perform simple computational procedures such as those which might underlie the generating process of the dataset they are trained on. The most popular and successful approach for learning neural networks is to optimize their parameters with respect to some objective function using standard methods for nonlinear optimization. Because basic methods like stochastic gradient descent (SGD) can often be very slow for deeply layered neural netwo...
Learning non use-case specific models has been shown to be a challenging task in Deep Learning (DL)....
Recently, we proposed to transform the outputs of each hidden neuron in a multi-layer perceptron net...
Understanding intelligence and how it allows humans to learn, to make decision and form memories, is...
Neural networks are an important class of highly flexible and powerful models inspired by the struct...
In this dissertation, we are concerned with the advancement of optimization algorithms for training ...
Second-order optimization methods applied to train deep neural net- works use the curvature informat...
Second-order optimization methods have the ability to accelerate convergence by modifying the gradie...
Second-order optimizers are thought to hold the potential to speed up neural network training, but d...
We propose a fast second-order method that can be used as a drop-in replacement for current deep lea...
Hessian-free (HF) optimization has been successfully used for training deep au-toencoders and recurr...
For a long time, second-order optimization methods have been regarded as computationally inefficient...
The stochastic gradient method is currently the prevailing technology for training neural networks. ...
Hessian-based analysis/computation is widely used in scientific computing. However, due to the (inco...
In the recent decade, deep neural networks have solved ever more complex tasks across many fronts in...
Learning a deep neural network requires solving a challenging optimization problem: it is a high-dim...
Learning non use-case specific models has been shown to be a challenging task in Deep Learning (DL)....
Recently, we proposed to transform the outputs of each hidden neuron in a multi-layer perceptron net...
Understanding intelligence and how it allows humans to learn, to make decision and form memories, is...
Neural networks are an important class of highly flexible and powerful models inspired by the struct...
In this dissertation, we are concerned with the advancement of optimization algorithms for training ...
Second-order optimization methods applied to train deep neural net- works use the curvature informat...
Second-order optimization methods have the ability to accelerate convergence by modifying the gradie...
Second-order optimizers are thought to hold the potential to speed up neural network training, but d...
We propose a fast second-order method that can be used as a drop-in replacement for current deep lea...
Hessian-free (HF) optimization has been successfully used for training deep au-toencoders and recurr...
For a long time, second-order optimization methods have been regarded as computationally inefficient...
The stochastic gradient method is currently the prevailing technology for training neural networks. ...
Hessian-based analysis/computation is widely used in scientific computing. However, due to the (inco...
In the recent decade, deep neural networks have solved ever more complex tasks across many fronts in...
Learning a deep neural network requires solving a challenging optimization problem: it is a high-dim...
Learning non use-case specific models has been shown to be a challenging task in Deep Learning (DL)....
Recently, we proposed to transform the outputs of each hidden neuron in a multi-layer perceptron net...
Understanding intelligence and how it allows humans to learn, to make decision and form memories, is...