In the last decade or so, deep learning has revolutionized entire domains of machine learning. Neural networks have helped achieve significant improvements in computer vision, machine translation, speech recognition, etc. These powerful empirical demonstrations leave a wide gap between our current theoretical understanding of neural networks and their practical performance. The theoretical questions in deep learning can be put under three broad but inter-related themes: 1) Architecture/Representation, 2) Optimization, and 3) Generalization. In this dissertation, we study the landscapes of different deep learning problems to answer questions in the above themes. First, in order to understand what representations can be learned by neural netw...
Despite the fact that the loss functions of deep neural networks are highly non-convex, gradient-bas...
As deep learning has become solution for various machine learning, artificial intelligence applicati...
It has been empirically observed in deep learning that the training problem of deep over-parameteriz...
With a direct analysis of neural networks, this paper presents a mathematically tight generalization...
Deep learning has transformed computer vision, natural language processing, and speech recognition. ...
In recent years Deep Neural Networks (DNNs) have achieved state-of-the-art results in many fields su...
The generalization mystery in deep learning is the following: Why do over-parameterized neural netwo...
The general features of the optimization problem for the case of overparametrized nonlinear networks...
How to train deep neural networks (DNNs) to generalize well is a central concern in deep learning, e...
This paper provides theoretical insights into why and how deep learning can generalize well, despite...
A main puzzle of deep networks revolves around the absence of overfitting despite overparametrizatio...
Despite the fact that the loss functions of deep neural networks are highly non-convex, gradient-bas...
The success of deep learning has revealed the application potential of neural networks across the sc...
Across scientific and engineering disciplines, the algorithmic pipeline forprocessing and understand...
The success of deep learning has revealed the application potential of neural networks across the sc...
Despite the fact that the loss functions of deep neural networks are highly non-convex, gradient-bas...
As deep learning has become solution for various machine learning, artificial intelligence applicati...
It has been empirically observed in deep learning that the training problem of deep over-parameteriz...
With a direct analysis of neural networks, this paper presents a mathematically tight generalization...
Deep learning has transformed computer vision, natural language processing, and speech recognition. ...
In recent years Deep Neural Networks (DNNs) have achieved state-of-the-art results in many fields su...
The generalization mystery in deep learning is the following: Why do over-parameterized neural netwo...
The general features of the optimization problem for the case of overparametrized nonlinear networks...
How to train deep neural networks (DNNs) to generalize well is a central concern in deep learning, e...
This paper provides theoretical insights into why and how deep learning can generalize well, despite...
A main puzzle of deep networks revolves around the absence of overfitting despite overparametrizatio...
Despite the fact that the loss functions of deep neural networks are highly non-convex, gradient-bas...
The success of deep learning has revealed the application potential of neural networks across the sc...
Across scientific and engineering disciplines, the algorithmic pipeline forprocessing and understand...
The success of deep learning has revealed the application potential of neural networks across the sc...
Despite the fact that the loss functions of deep neural networks are highly non-convex, gradient-bas...
As deep learning has become solution for various machine learning, artificial intelligence applicati...
It has been empirically observed in deep learning that the training problem of deep over-parameteriz...