This paper presents theoretical results on combining non-probability and probability survey samples through mass imputation, an approach originally proposed by Rivers (2007) as sample matching without rigorous theoretical justification. Under suitable regularity conditions, we establish the consistency of the mass imputation estimator and derive its asymptotic variance formula. Variance estimators are developed using either linearization or bootstrap. Finite sample performances of the mass imputation estimator are investigated through simulation studies and an application to analyzing a non-probability sample collected by the Pew Research Centre
Survey data collection costs have risen to a point where many survey researchers and polling compani...
Rubin’s variance estimator of the multiple imputation estimator for a domain mean is not asymptotica...
International audienceItem non-response in surveys is usually handled by single imputation, whose ma...
Analysis of non-probability survey samples requires auxiliary information at the population level. S...
Multiple data sources are becoming increasingly available for statistical analyses in the era of big...
Predictive mean matching imputation is popular for handling item nonresponse in survey sampling. In ...
Predictive mean matching imputation is popular for handling item nonresponse in survey sampling. In ...
Imputed values in surveys are often generated under the assumption that the sampling mechanism is no...
Finite population inference is a central goal in survey sampling. Probability sampling is the main s...
Peer Reviewedhttps://deepblue.lib.umich.edu/bitstream/2027.42/142493/1/biom12413_am.pdfhttps://deepb...
A well-known problem in the field of survey sampling is the problem of missing data due to a number ...
The properties of the usual estimator of the population mean or total calculated using a sample that...
Missing data are a pervasive problem in large-scale surveys, arising when a sampled unit does not re...
Multiple imputation provides an effective way to handle missing data. When several possible models a...
Nearest neighbor imputation has a long tradition for handling item nonresponse in survey sampling. I...
Survey data collection costs have risen to a point where many survey researchers and polling compani...
Rubin’s variance estimator of the multiple imputation estimator for a domain mean is not asymptotica...
International audienceItem non-response in surveys is usually handled by single imputation, whose ma...
Analysis of non-probability survey samples requires auxiliary information at the population level. S...
Multiple data sources are becoming increasingly available for statistical analyses in the era of big...
Predictive mean matching imputation is popular for handling item nonresponse in survey sampling. In ...
Predictive mean matching imputation is popular for handling item nonresponse in survey sampling. In ...
Imputed values in surveys are often generated under the assumption that the sampling mechanism is no...
Finite population inference is a central goal in survey sampling. Probability sampling is the main s...
Peer Reviewedhttps://deepblue.lib.umich.edu/bitstream/2027.42/142493/1/biom12413_am.pdfhttps://deepb...
A well-known problem in the field of survey sampling is the problem of missing data due to a number ...
The properties of the usual estimator of the population mean or total calculated using a sample that...
Missing data are a pervasive problem in large-scale surveys, arising when a sampled unit does not re...
Multiple imputation provides an effective way to handle missing data. When several possible models a...
Nearest neighbor imputation has a long tradition for handling item nonresponse in survey sampling. I...
Survey data collection costs have risen to a point where many survey researchers and polling compani...
Rubin’s variance estimator of the multiple imputation estimator for a domain mean is not asymptotica...
International audienceItem non-response in surveys is usually handled by single imputation, whose ma...