Predictive distributions quantify uncertainties ignored by point estimates. This paper introduces The Neural Testbed: an open-source benchmark for controlled and principled evaluation of agents that generate such predictions. Crucially, the testbed assesses agents not only on the quality of their marginal predictions per input, but also on their joint predictions across many inputs. We evaluate a range of agents using a simple neural network data generating process. Our results indicate that some popular Bayesian deep learning agents do not fare well with joint predictions, even when they can produce accurate marginal predictions. We also show that the quality of joint predictions drives performance in downstream decision tasks. We find the...
Deep neural networks (DNNs) have surpassed human-level accuracy in various learning tasks. However, ...
The ability to output accurate predictive uncertainty estimates is vital to a reliable classifier. S...
Accurate uncertainty quantification is necessary to enhance the reliability of deep learning models ...
A fundamental challenge for any intelligent system is prediction: given some inputs, can you predict...
Modern deep learning methods constitute incredibly powerful tools to tackle a myriad of challenging ...
The breakout success of deep neural networks (NNs) in the 2010's marked a new era in the quest to bu...
Modern deep learning methods constitute incredibly powerful tools to tackle a myriad of challenging ...
Designing uncertainty-aware deep learning models which are able to provide reasonable uncertainties ...
The 27th AIAI Irish Conference on Artificial Intelligence and Cognitive Science (AICS 2019) Galway, ...
Neural networks predictions are unreliable when the input sample is out of the training distribution...
Deep Ensembles, as a type of Bayesian Neural Networks, can be used to estimate uncertainty on the pr...
Intelligence relies on an agent's knowledge of what it does not know. This capability can be assesse...
Deep neural networks (DNNs) have surpassed human-level accuracy in various fields, including object ...
Deep learning models have shown promising results in areas including computer vision, natural langua...
Estimating the uncertainty in deep neural network predictions is crucial for many real-world applica...
Deep neural networks (DNNs) have surpassed human-level accuracy in various learning tasks. However, ...
The ability to output accurate predictive uncertainty estimates is vital to a reliable classifier. S...
Accurate uncertainty quantification is necessary to enhance the reliability of deep learning models ...
A fundamental challenge for any intelligent system is prediction: given some inputs, can you predict...
Modern deep learning methods constitute incredibly powerful tools to tackle a myriad of challenging ...
The breakout success of deep neural networks (NNs) in the 2010's marked a new era in the quest to bu...
Modern deep learning methods constitute incredibly powerful tools to tackle a myriad of challenging ...
Designing uncertainty-aware deep learning models which are able to provide reasonable uncertainties ...
The 27th AIAI Irish Conference on Artificial Intelligence and Cognitive Science (AICS 2019) Galway, ...
Neural networks predictions are unreliable when the input sample is out of the training distribution...
Deep Ensembles, as a type of Bayesian Neural Networks, can be used to estimate uncertainty on the pr...
Intelligence relies on an agent's knowledge of what it does not know. This capability can be assesse...
Deep neural networks (DNNs) have surpassed human-level accuracy in various fields, including object ...
Deep learning models have shown promising results in areas including computer vision, natural langua...
Estimating the uncertainty in deep neural network predictions is crucial for many real-world applica...
Deep neural networks (DNNs) have surpassed human-level accuracy in various learning tasks. However, ...
The ability to output accurate predictive uncertainty estimates is vital to a reliable classifier. S...
Accurate uncertainty quantification is necessary to enhance the reliability of deep learning models ...