We formalize a framework for quantitatively assessing agreement between two datasets that are assumed to come from two distinct data generating mechanisms. We propose a methodology for prediction scoring which provides a measure of the distance between two unobserved data generating mechanisms (DGMs), along the dimension of a particular model. The cross-validated scores can be used to evaluate preregistered hypotheses and to perform model validation in the face of complex statistical models. Using human behavior data from the Next Generation Social Science (NGS2) program, we demonstrate that prediction scores can be used as model assessment tools and that they can reveal insights based on data collected from different populations and across...
The veracity of scientific claims is not always certain. In fact, sufficient claims have been proven...
<p>In this toy model, we try predict a variable from an uncorrelated predictor. The predictive power...
The goal of science is to accumulate knowledge that answers questions such as How do things work? ...
Evaluation of predictive models is a ubiquitous task in machine learning and data mining. Cross-vali...
The introduction of new “machine learning” methods and terminology to political science complicates ...
We measure how accurately replication of experimental results can be predicted by black-box statisti...
Motivation: Numerous competing algorithms for prediction in high-dimensional settings have been deve...
We measure how accurately replication of experimental results can be predicted by black-box statisti...
Data analysis usually aims to identify a particular signal, such as an intervention effect. Conventi...
Cross-validation is the process of comparing a model’s predictions to data that were not used in the...
Recent controversies about the level of replicability of behavioral research analyzed using statisti...
In applied statistics, tools from machine learning are popular for analyzing complex and high-dimens...
Cross-validation, an economical method for assessing whether sample results will generalize, is disc...
We consider comparisons of statistical learning algorithms using multiple data sets, via leave-one-i...
Do scientific claims, based on systematic observations, mean they are compulsorily true? Some empiri...
The veracity of scientific claims is not always certain. In fact, sufficient claims have been proven...
<p>In this toy model, we try predict a variable from an uncorrelated predictor. The predictive power...
The goal of science is to accumulate knowledge that answers questions such as How do things work? ...
Evaluation of predictive models is a ubiquitous task in machine learning and data mining. Cross-vali...
The introduction of new “machine learning” methods and terminology to political science complicates ...
We measure how accurately replication of experimental results can be predicted by black-box statisti...
Motivation: Numerous competing algorithms for prediction in high-dimensional settings have been deve...
We measure how accurately replication of experimental results can be predicted by black-box statisti...
Data analysis usually aims to identify a particular signal, such as an intervention effect. Conventi...
Cross-validation is the process of comparing a model’s predictions to data that were not used in the...
Recent controversies about the level of replicability of behavioral research analyzed using statisti...
In applied statistics, tools from machine learning are popular for analyzing complex and high-dimens...
Cross-validation, an economical method for assessing whether sample results will generalize, is disc...
We consider comparisons of statistical learning algorithms using multiple data sets, via leave-one-i...
Do scientific claims, based on systematic observations, mean they are compulsorily true? Some empiri...
The veracity of scientific claims is not always certain. In fact, sufficient claims have been proven...
<p>In this toy model, we try predict a variable from an uncorrelated predictor. The predictive power...
The goal of science is to accumulate knowledge that answers questions such as How do things work? ...