Abstract. In this paper we present lessons learned in the Evaluating Predictive Uncertainty Challenge. We describe the methods we used in regression challenges, including our winning method for the Outaouais data set. We then turn our attention to the more general problem of scoring in probabilistic machine learning challenges. It is widely accepted that scoring rules should be proper in the sense that the true generative distribution has the best expected score; we note that while this is useful, it does not guarantee finding the best methods for practical machine learning tasks. We point out some problems in local scoring rules such as the negative logarithm of predictive density (NLPD), and illustrate with examples that many of these pro...
Quantitative characterizations and estimations of uncertainty are of fundamental importance for mach...
The evaluation of probabilistic forecasts plays a central role both in the interpretation and in the...
We propose and motivate an expanded version of the logarithmic score for forecasting distributions,...
This Chapter presents the PASCAL Evaluating Predictive Uncertainty Challenge, introduces the contrib...
There are several scoring rules that one can choose from in order to score probabilistic forecasting...
Questions remain regarding how the skill of operational probabilistic forecasts is most usefully eva...
Ascoring rule S(x; q) provides away of judging the quality of a quoted probability density q for a r...
<p> Probability forecasts play an important role in many decision and risk analysis applications. Re...
We construct a model of expert prediction where predictions can influence the state of the world. Un...
Calibration, the statistical consistency of forecast distributions and the observations, is a centra...
Predictions and forecasts of machine learning models should take the form of probability distributio...
A scoring rule is a device for eliciting and assessing probabilistic forecasts from an agent. When d...
Most of the methods nowadays employed in forecast problems are based on scoring rules. There is a di...
Scoring rules are an important tool for evaluating the performance of probabilistic forecasting sche...
Scoring rules can provide incentives for truthful reporting of probabilities and evaluation measures...
Quantitative characterizations and estimations of uncertainty are of fundamental importance for mach...
The evaluation of probabilistic forecasts plays a central role both in the interpretation and in the...
We propose and motivate an expanded version of the logarithmic score for forecasting distributions,...
This Chapter presents the PASCAL Evaluating Predictive Uncertainty Challenge, introduces the contrib...
There are several scoring rules that one can choose from in order to score probabilistic forecasting...
Questions remain regarding how the skill of operational probabilistic forecasts is most usefully eva...
Ascoring rule S(x; q) provides away of judging the quality of a quoted probability density q for a r...
<p> Probability forecasts play an important role in many decision and risk analysis applications. Re...
We construct a model of expert prediction where predictions can influence the state of the world. Un...
Calibration, the statistical consistency of forecast distributions and the observations, is a centra...
Predictions and forecasts of machine learning models should take the form of probability distributio...
A scoring rule is a device for eliciting and assessing probabilistic forecasts from an agent. When d...
Most of the methods nowadays employed in forecast problems are based on scoring rules. There is a di...
Scoring rules are an important tool for evaluating the performance of probabilistic forecasting sche...
Scoring rules can provide incentives for truthful reporting of probabilities and evaluation measures...
Quantitative characterizations and estimations of uncertainty are of fundamental importance for mach...
The evaluation of probabilistic forecasts plays a central role both in the interpretation and in the...
We propose and motivate an expanded version of the logarithmic score for forecasting distributions,...