Deep artificial neural networks achieve surprising generalization abilities that remain poorly understood. In this paper, we present a new approach to analyzing generalization for deep feed-forward ReLU networks that takes advantage of the degree of sparsity that is achieved in the hidden layer activations. By developing a framework that accounts for this reduced effective model size for each input sample, we are able to show fundamental trade-offs between sparsity and generalization. Importantly, our results make no strong assumptions about the degree of sparsity achieved by the model, and it improves over recent norm-based approaches. We illustrate our results numerically, demonstrating non-vacuous bounds when coupled with data-dependent ...
In the last decade or so, deep learning has revolutionized entire domains of machine learning. Neura...
It is well-known that modern neural networks are vulnerable to adversarial examples. To mitigate thi...
Deep learning is finding its way into the embedded world with applications such as autonomous drivin...
With a direct analysis of neural networks, this paper presents a mathematically tight generalization...
Empirical studies show that gradient-based methods can learn deep neural networks (DNNs) with very g...
Empirical studies show that gradient-based methods can learn deep neural networks (DNNs) with very g...
Deep learning has transformed computer vision, natural language processing, and speech recognition. ...
The growing energy and performance costs of deep learning have driven the community to reduce the si...
How to train deep neural networks (DNNs) to generalize well is a central concern in deep learning, e...
One of the important challenges today in deep learning is explaining the outstanding power of genera...
One of the important challenges today in deep learning is explaining the outstanding power of genera...
By making assumptions on the probability distribution of the potentials in a feed-forward neural net...
In this work, we construct generalization bounds to understand existing learning algorithms and prop...
We focus on a specific class of shallow neural networks with a single hidden layer, namely those wit...
Large neural networks are very successful in various tasks. However, with limited data, the generali...
In the last decade or so, deep learning has revolutionized entire domains of machine learning. Neura...
It is well-known that modern neural networks are vulnerable to adversarial examples. To mitigate thi...
Deep learning is finding its way into the embedded world with applications such as autonomous drivin...
With a direct analysis of neural networks, this paper presents a mathematically tight generalization...
Empirical studies show that gradient-based methods can learn deep neural networks (DNNs) with very g...
Empirical studies show that gradient-based methods can learn deep neural networks (DNNs) with very g...
Deep learning has transformed computer vision, natural language processing, and speech recognition. ...
The growing energy and performance costs of deep learning have driven the community to reduce the si...
How to train deep neural networks (DNNs) to generalize well is a central concern in deep learning, e...
One of the important challenges today in deep learning is explaining the outstanding power of genera...
One of the important challenges today in deep learning is explaining the outstanding power of genera...
By making assumptions on the probability distribution of the potentials in a feed-forward neural net...
In this work, we construct generalization bounds to understand existing learning algorithms and prop...
We focus on a specific class of shallow neural networks with a single hidden layer, namely those wit...
Large neural networks are very successful in various tasks. However, with limited data, the generali...
In the last decade or so, deep learning has revolutionized entire domains of machine learning. Neura...
It is well-known that modern neural networks are vulnerable to adversarial examples. To mitigate thi...
Deep learning is finding its way into the embedded world with applications such as autonomous drivin...