Isotropic Gaussian priors are the de facto standard for modern Bayesian neural network inference. However, such simplistic priors are unlikely to either accurately reflect our true beliefs about the weight distributions, or to give optimal performance. We study summary statistics of neural network weights in different networks trained using SGD. We find that fully connected networks (FCNNs) display heavy-tailed weight distributions, while convolutional neural network (CNN) weights display strong spatial correlations. Building these observations into the respective priors leads to improved performance on a variety of image classification datasets. Moreover, we find that these priors also mitigate the cold posterior effect in FCNNs, while in ...
Bayesian neural networks have shown great promise in many applications where calibrated uncertainty ...
Existing Bayesian treatments of neural networks are typically characterized by weak prior and approx...
Understanding the relationship between connectionist and probabilistic models is important for evalu...
Deep neural networks have bested notable benchmarks across computer vision, reinforcement learning, ...
International audienceWe investigate deep Bayesian neural networks with Gaussian priors on the weigh...
Bayesian inference is known to provide a general framework for incorporating prior knowledge or spec...
International audienceWe investigate deep Bayesian neural networks with Gaussian priors on the weigh...
We investigate deep Bayesian neural networks with Gaussian priors on the weights and ReLU-like nonli...
10 pages, 5 figures, ICML'19 conferenceInternational audienceWe investigate deep Bayesian neural net...
The Bayesian treatment of neural networks dictates that a prior distribution is specified over their...
Stochastic variational inference for Bayesian deep neural network (DNN) requires specifying priors a...
| openaire: EC/H2020/101016775/EU//INTERVENEEncoding domain knowledge into the prior over the high-d...
The paper deals with learning probability distributions of observed data by artificial neural networ...
In recent years, Neural Networks (NN) have become a popular data-analytic tool in Statistics, Compu...
This paper studies the novel concept of weight correlation in deep neural networks and discusses its...
Bayesian neural networks have shown great promise in many applications where calibrated uncertainty ...
Existing Bayesian treatments of neural networks are typically characterized by weak prior and approx...
Understanding the relationship between connectionist and probabilistic models is important for evalu...
Deep neural networks have bested notable benchmarks across computer vision, reinforcement learning, ...
International audienceWe investigate deep Bayesian neural networks with Gaussian priors on the weigh...
Bayesian inference is known to provide a general framework for incorporating prior knowledge or spec...
International audienceWe investigate deep Bayesian neural networks with Gaussian priors on the weigh...
We investigate deep Bayesian neural networks with Gaussian priors on the weights and ReLU-like nonli...
10 pages, 5 figures, ICML'19 conferenceInternational audienceWe investigate deep Bayesian neural net...
The Bayesian treatment of neural networks dictates that a prior distribution is specified over their...
Stochastic variational inference for Bayesian deep neural network (DNN) requires specifying priors a...
| openaire: EC/H2020/101016775/EU//INTERVENEEncoding domain knowledge into the prior over the high-d...
The paper deals with learning probability distributions of observed data by artificial neural networ...
In recent years, Neural Networks (NN) have become a popular data-analytic tool in Statistics, Compu...
This paper studies the novel concept of weight correlation in deep neural networks and discusses its...
Bayesian neural networks have shown great promise in many applications where calibrated uncertainty ...
Existing Bayesian treatments of neural networks are typically characterized by weak prior and approx...
Understanding the relationship between connectionist and probabilistic models is important for evalu...