<p>Most variable selection techniques for high-dimensional models are designed to be used in settings, where observations are independent and completely observed. At the same time, there is a rich literature on approaches to estimation of low-dimensional parameters in the presence of correlation, missingness, measurement error, selection bias, and other characteristics of real data. In this article, we present ThrEEBoost (<i>Thr</i>esholded <i>EEBoost</i>), a general-purpose variable selection technique which can accommodate such problem characteristics by replacing the gradient of the loss by an estimating function. ThrEEBoost generalizes the previously proposed EEBoost algorithm (Wolfson <a href="#cit0026" target="_blank">2011</a>) by all...
Preliminary version Several algorithms for indicator saturation are compared and found to have low p...
In biomedical research, boosting-based regression approaches have gained much attention in the last ...
Abstract: High dimensional data are nowadays encountered in various branches of science. Variable se...
University of Minnesota Ph.D. dissertation. July 2017. Major: Biostatistics. Advisors: Julian Wolfso...
We present a statistical perspective on boosting. Special emphasis is given to estimating potentiall...
We present a new variable selection method based on model-based gradient boosting and randomly permu...
Variable selection and estimation for high-dimensional data have become a topic of foremost importan...
Abstract High-dimensional prediction typically comprises vari-able selection followed by least-squar...
Given n noisy samples with p dimensions, where n ≪ p, we show that the multi-step thresholding proce...
With respect to variable selection in linear regression, partial correlation for normal models (Buhl...
Motivation: With the growth of big data, variable selection has become one of the critical challenge...
We review variable selection and variable screening in high-dimensional linear models. Thereby, a ma...
Background Modern biotechnologies often result in high-dimensional data sets with many more varia...
From the perspective of econometrics, an accurate variable selection method greatly enhances the rel...
Boosting is one of the most important methods for fitting regression models and building prediction...
Preliminary version Several algorithms for indicator saturation are compared and found to have low p...
In biomedical research, boosting-based regression approaches have gained much attention in the last ...
Abstract: High dimensional data are nowadays encountered in various branches of science. Variable se...
University of Minnesota Ph.D. dissertation. July 2017. Major: Biostatistics. Advisors: Julian Wolfso...
We present a statistical perspective on boosting. Special emphasis is given to estimating potentiall...
We present a new variable selection method based on model-based gradient boosting and randomly permu...
Variable selection and estimation for high-dimensional data have become a topic of foremost importan...
Abstract High-dimensional prediction typically comprises vari-able selection followed by least-squar...
Given n noisy samples with p dimensions, where n ≪ p, we show that the multi-step thresholding proce...
With respect to variable selection in linear regression, partial correlation for normal models (Buhl...
Motivation: With the growth of big data, variable selection has become one of the critical challenge...
We review variable selection and variable screening in high-dimensional linear models. Thereby, a ma...
Background Modern biotechnologies often result in high-dimensional data sets with many more varia...
From the perspective of econometrics, an accurate variable selection method greatly enhances the rel...
Boosting is one of the most important methods for fitting regression models and building prediction...
Preliminary version Several algorithms for indicator saturation are compared and found to have low p...
In biomedical research, boosting-based regression approaches have gained much attention in the last ...
Abstract: High dimensional data are nowadays encountered in various branches of science. Variable se...