Subsampling is a computationally effective approach to extract information from massive data sets when computing resources are limited. After a subsample is taken from the full data, most available methods use an inverse probability weighted (IPW) objective function to estimate the model parameters. The IPW estimator does not fully utilize the information in the selected subsample. In this paper, we propose to use the maximum sampled conditional likelihood estimator (MSCLE) based on the sampled data. We established the asymptotic normality of the MSCLE and prove that its asymptotic variance covariance matrix is the smallest among a class of asymptotically unbiased estimators, including the IPW estimator. We further discuss the asymptotic re...
Data subsampling has become widely recognized as a tool to overcome computational and economic bottl...
We outline how modern likelihood theory, which provides essentially exact inferences in a variety of...
The fate of scientific hypotheses often relies on the ability of a computational model to explain th...
To tackle massive data, subsampling is a practical approach to select the more informative data poin...
The optimal subsampling is an statistical methodology for generalized linear models (GLMs) to make i...
We propose Subsampling MCMC, a Markov Chain Monte Carlo (MCMC) framework where the likelihood functi...
<p>We propose subsampling Markov chain Monte Carlo (MCMC), an MCMC framework where the likelihood fu...
International audienceInference for the parametric distribution of a response given covariates is co...
Big data is ubiquitous in practices, and it has also led to heavy computation burden. To reduce the ...
Running machine learning algorithms on large and rapidly growing volumes of data is often computatio...
We address the problem of fast estimation of ordinary least squares (OLS) from large amounts of data...
We address the problem of fast estimation of ordinary least squares (OLS) from large amounts of data...
Indiana University-Purdue University Indianapolis (IUPUI)A significant hurdle for analyzing big data...
The bootstrap is a widely used procedure for statistical inference because of its simplicity and att...
The bootstrap is a widely used procedure for statistical inference because of its simplicity and att...
Data subsampling has become widely recognized as a tool to overcome computational and economic bottl...
We outline how modern likelihood theory, which provides essentially exact inferences in a variety of...
The fate of scientific hypotheses often relies on the ability of a computational model to explain th...
To tackle massive data, subsampling is a practical approach to select the more informative data poin...
The optimal subsampling is an statistical methodology for generalized linear models (GLMs) to make i...
We propose Subsampling MCMC, a Markov Chain Monte Carlo (MCMC) framework where the likelihood functi...
<p>We propose subsampling Markov chain Monte Carlo (MCMC), an MCMC framework where the likelihood fu...
International audienceInference for the parametric distribution of a response given covariates is co...
Big data is ubiquitous in practices, and it has also led to heavy computation burden. To reduce the ...
Running machine learning algorithms on large and rapidly growing volumes of data is often computatio...
We address the problem of fast estimation of ordinary least squares (OLS) from large amounts of data...
We address the problem of fast estimation of ordinary least squares (OLS) from large amounts of data...
Indiana University-Purdue University Indianapolis (IUPUI)A significant hurdle for analyzing big data...
The bootstrap is a widely used procedure for statistical inference because of its simplicity and att...
The bootstrap is a widely used procedure for statistical inference because of its simplicity and att...
Data subsampling has become widely recognized as a tool to overcome computational and economic bottl...
We outline how modern likelihood theory, which provides essentially exact inferences in a variety of...
The fate of scientific hypotheses often relies on the ability of a computational model to explain th...