In many application areas, prediction rules trained based on high-dimensional data are subsequently applied to make predictions for observations from other sources, but they do not always perform well in this setting. This is because data sets from different sources can feature (slightly) differing distributions, even if they come from similar populations. In the context of high-dimensional data and beyond, most prediction methods involve one or several tuning parameters. Their values are commonly chosen by maximizing the cross-validated prediction performance on the training data. This procedure, however, implicitly presumes that the data to which the prediction rule will be ultimately applied, follow the same distribution as the training ...
As DNA microarray data contain relatively small sample size compared to the number of genes, high di...
With advancements in genomic technologies, it is common to have two high-dimensional datasets, each ...
As DNA microarray data contain relatively small sample size compared to the number of genes, high di...
In many application areas, prediction rules trained based on high-dimensional data are subsequently ...
In many application areas, prediction rules trained based on high-dimensional data are subsequently ...
In biomedical research, boosting-based regression approaches have gained much attention in the last ...
High-dimensional data applications often entail the use of various statistical and machine-learning ...
Incorporating measurements on correlated traits into genomic prediction models can increase predicti...
Incorporating measurements on correlated traits into genomic prediction models can increase predicti...
We develop an inference framework for the difference in errors between 2 prediction procedures. The ...
Prediction models often yield inaccurate predictions for new individuals. Large data sets from poole...
Prediction models often yield inaccurate predictions for new individuals. Large data sets from poole...
Comparing the relative fit of competing models can be used to address many different scientific ques...
Many prediction methods have been proposed in the literature, but most of them ignore heterogeneity ...
Recent minimum sample size formula (Riley et al.) for developing clinical prediction models help ens...
As DNA microarray data contain relatively small sample size compared to the number of genes, high di...
With advancements in genomic technologies, it is common to have two high-dimensional datasets, each ...
As DNA microarray data contain relatively small sample size compared to the number of genes, high di...
In many application areas, prediction rules trained based on high-dimensional data are subsequently ...
In many application areas, prediction rules trained based on high-dimensional data are subsequently ...
In biomedical research, boosting-based regression approaches have gained much attention in the last ...
High-dimensional data applications often entail the use of various statistical and machine-learning ...
Incorporating measurements on correlated traits into genomic prediction models can increase predicti...
Incorporating measurements on correlated traits into genomic prediction models can increase predicti...
We develop an inference framework for the difference in errors between 2 prediction procedures. The ...
Prediction models often yield inaccurate predictions for new individuals. Large data sets from poole...
Prediction models often yield inaccurate predictions for new individuals. Large data sets from poole...
Comparing the relative fit of competing models can be used to address many different scientific ques...
Many prediction methods have been proposed in the literature, but most of them ignore heterogeneity ...
Recent minimum sample size formula (Riley et al.) for developing clinical prediction models help ens...
As DNA microarray data contain relatively small sample size compared to the number of genes, high di...
With advancements in genomic technologies, it is common to have two high-dimensional datasets, each ...
As DNA microarray data contain relatively small sample size compared to the number of genes, high di...