Using a metabolomics data set with 1057 serum samples, we designed and assessed different procedures based on Monte Carlo resampling schemes to determine the optimal number of components to be included in partial least squares (PLS) regression models. Corresponding estimates of prediction error were calculated and compared in a single algorithm comprising i) a single loop Monte Carlo approach repeatedly and randomly splitting samples into calibration and validation samples, ii) a double loop validation splitting samples into calibration/validation and prediction sets, and, iii) independent sample sets in a third loop. In order to mimic the common situation with only a moderate number of samples available for building the model, only a fract...
Motivation: In genomic studies, thousands of features are collected on relatively few samples. One o...
Data sets with multiple responses and multiple predictor variables are increasingly common. It is kn...
Partial least squares (PLS) is a class of statistical methods for multivariate data analysis. In the...
Using a metabolomics data set with 1057 serum samples, we designed and assessed different procedures...
The selection of the optimal number of components remains a difficult but essential task in partial ...
Partial least squares (PLS) is one of the most commonly used supervised modelling approaches for ana...
The predominance of partial least squares-discriminant analysis (PLS-DA) used to analyze metabolomic...
High-dimensional data applications often entail the use of various statistical and machine-learning ...
Monte Carlo simulation methods was used to study the effects of the data structure on the quality of...
Metabolomic studies with a time-series design are widely used for discovery and validation of biomar...
Purpose The purpose of this paper is to enhance consistent partial least squares (PLSc) to yield con...
Partial Least Squares (PLS) is a popular technique with extensive adoption within the Information Sy...
Purpose The purpose of this paper is to enhance consistent partial least squares (PLSc) to yield con...
Partial Least Squares-Discriminant Analysis (PLS-DA) is a PLS regression method with a special binar...
Statistical model validation tools such as cross-validation, jack-knifing model parameters and permu...
Motivation: In genomic studies, thousands of features are collected on relatively few samples. One o...
Data sets with multiple responses and multiple predictor variables are increasingly common. It is kn...
Partial least squares (PLS) is a class of statistical methods for multivariate data analysis. In the...
Using a metabolomics data set with 1057 serum samples, we designed and assessed different procedures...
The selection of the optimal number of components remains a difficult but essential task in partial ...
Partial least squares (PLS) is one of the most commonly used supervised modelling approaches for ana...
The predominance of partial least squares-discriminant analysis (PLS-DA) used to analyze metabolomic...
High-dimensional data applications often entail the use of various statistical and machine-learning ...
Monte Carlo simulation methods was used to study the effects of the data structure on the quality of...
Metabolomic studies with a time-series design are widely used for discovery and validation of biomar...
Purpose The purpose of this paper is to enhance consistent partial least squares (PLSc) to yield con...
Partial Least Squares (PLS) is a popular technique with extensive adoption within the Information Sy...
Purpose The purpose of this paper is to enhance consistent partial least squares (PLSc) to yield con...
Partial Least Squares-Discriminant Analysis (PLS-DA) is a PLS regression method with a special binar...
Statistical model validation tools such as cross-validation, jack-knifing model parameters and permu...
Motivation: In genomic studies, thousands of features are collected on relatively few samples. One o...
Data sets with multiple responses and multiple predictor variables are increasingly common. It is kn...
Partial least squares (PLS) is a class of statistical methods for multivariate data analysis. In the...