We show that a GPT-3 model can learn to express uncertainty about its own answers in natural language -- without use of model logits. When given a question, the model generates both an answer and a level of confidence (e.g. "90% confidence" or "high confidence"). These levels map to probabilities that are well calibrated. The model also remains moderately calibrated under distribution shift, and is sensitive to uncertainty in its own answers, rather than imitating human examples. To our knowledge, this is the first time a model has been shown to express calibrated uncertainty about its own answers in natural language. For testing calibration, we introduce the CalibratedMath suite of tasks. We compare the calibration of uncertainty expressed...
This paper introduces Bayesian uncertainty modeling using Stochastic Weight Averaging-Gaussian (SWAG...
International audienceLearning in a stochastic environment consists of estimating a model from a lim...
A learning procedure takes as input a dataset and performs inference for the parameters θ of a model...
With the recent success of deep learning methods, neural-based models have achieved superior perform...
Despite recent advances in statistical machine learning that significantly improve performance, the ...
We investigate the problem of determining the predictive confidence (or, conversely, uncertainty) of...
Reliable uncertainty quantification is a first step towards building explainable, transparent, and a...
The increased deployment of LMs for real-world tasks involving knowledge and facts makes it importan...
International audienceMachine Learning models can output confident but incorrect predictions. To add...
Predictive uncertainty estimation of pre-trained language models is an important measure of how like...
Detecting speculative assertions is essential to distinguish seman-tically uncertain information fro...
Both uncertainty estimation and interpretability are important factors for trustworthy machine learn...
International audienceDesigning approaches able to automatically detect uncertain expressions within...
Multi-class classification methods that produce sets of probabilistic classifiers, such as ensemble ...
Language Models are being widely used in Education. Even though modern deep learning models achieve ...
This paper introduces Bayesian uncertainty modeling using Stochastic Weight Averaging-Gaussian (SWAG...
International audienceLearning in a stochastic environment consists of estimating a model from a lim...
A learning procedure takes as input a dataset and performs inference for the parameters θ of a model...
With the recent success of deep learning methods, neural-based models have achieved superior perform...
Despite recent advances in statistical machine learning that significantly improve performance, the ...
We investigate the problem of determining the predictive confidence (or, conversely, uncertainty) of...
Reliable uncertainty quantification is a first step towards building explainable, transparent, and a...
The increased deployment of LMs for real-world tasks involving knowledge and facts makes it importan...
International audienceMachine Learning models can output confident but incorrect predictions. To add...
Predictive uncertainty estimation of pre-trained language models is an important measure of how like...
Detecting speculative assertions is essential to distinguish seman-tically uncertain information fro...
Both uncertainty estimation and interpretability are important factors for trustworthy machine learn...
International audienceDesigning approaches able to automatically detect uncertain expressions within...
Multi-class classification methods that produce sets of probabilistic classifiers, such as ensemble ...
Language Models are being widely used in Education. Even though modern deep learning models achieve ...
This paper introduces Bayesian uncertainty modeling using Stochastic Weight Averaging-Gaussian (SWAG...
International audienceLearning in a stochastic environment consists of estimating a model from a lim...
A learning procedure takes as input a dataset and performs inference for the parameters θ of a model...