Cross-validation (CV) methods are popular for selecting the tuning parameter in the high-dimensional variable selection problem. We show the misalignment of the CV is one possible reason of its over-selection behavior. To fix this issue, we propose a version of leave-nv-out cross-validation (CV(nv)), for selecting the optimal model among the restricted candidate model set for high-dimensional generalized linear models. By using the same candidate model sequence and a proper order of construction sample size nc in each CV split, CV(nv) avoids the potential hurdles in developing theoretical properties. CV(nv) is shown to enjoy the restricted model selection consistency property under mild conditions. Extensive simulations and real data analys...
High-dimensional data are widely encountered in a great variety of areas such as bioinformatics, med...
We consider variable selection in the single-index model. We prove that the popular leave-m-out cros...
Longitudinal models are commonly used for studying data collected on individuals repeatedly through ...
Asymptotic behavior of the tuning parameter selection in the standard cross-validation methods is in...
We consider the problem of model (or variable) selection in the classical regression model based on ...
Forward Selection (FS) is a popular variable selection method for linear regression. Working in a sp...
Forward Selection (FS) is a popular variable selection method for linear regression. Working in a sp...
This paper investigates two types of results that support the use of Generalized Cross Validation (G...
that leave-one-out cross-validation is not subject to the “no-free-lunch ” criticism. Despite this o...
This article gives a robust technique for model selection in regression models, an important aspect ...
Variable selection criteria in a linear regression model are analyzed, considering consistency and e...
Motivation: Validation of variable selection and predictive performance is crucial in construction o...
Generalized linear models are popular for modelling a large variety of data. We consider variable se...
A natural technique to select variables in the context of generalized linear models is to use a step...
Model selection is difficult to analyse yet theoretically and empirically important, especially for ...
High-dimensional data are widely encountered in a great variety of areas such as bioinformatics, med...
We consider variable selection in the single-index model. We prove that the popular leave-m-out cros...
Longitudinal models are commonly used for studying data collected on individuals repeatedly through ...
Asymptotic behavior of the tuning parameter selection in the standard cross-validation methods is in...
We consider the problem of model (or variable) selection in the classical regression model based on ...
Forward Selection (FS) is a popular variable selection method for linear regression. Working in a sp...
Forward Selection (FS) is a popular variable selection method for linear regression. Working in a sp...
This paper investigates two types of results that support the use of Generalized Cross Validation (G...
that leave-one-out cross-validation is not subject to the “no-free-lunch ” criticism. Despite this o...
This article gives a robust technique for model selection in regression models, an important aspect ...
Variable selection criteria in a linear regression model are analyzed, considering consistency and e...
Motivation: Validation of variable selection and predictive performance is crucial in construction o...
Generalized linear models are popular for modelling a large variety of data. We consider variable se...
A natural technique to select variables in the context of generalized linear models is to use a step...
Model selection is difficult to analyse yet theoretically and empirically important, especially for ...
High-dimensional data are widely encountered in a great variety of areas such as bioinformatics, med...
We consider variable selection in the single-index model. We prove that the popular leave-m-out cros...
Longitudinal models are commonly used for studying data collected on individuals repeatedly through ...