Batch effects are due to probe-specific systematic variation between groups of samples (batches) resulting from experimental features that are not of biological interest. Principal components analysis (PCA) is commonly used as a visual tool to determine whether batch effects exist after applying a global normalization method. However, PCA yields linear combinations of the variables that contribute maximum variance and thus will not necessarily detect batch effects if they are not the largest source of variability in the data. We present an extension of principal components analysis to quantify the existence of batch effects, called guided PCA (gPCA). We describe a test statistic that uses gPCA to test if a batch effect exists. We apply our ...
It is now known that unwanted noise and unmodeled artifacts such as batch effects can dramatically r...
High-throughput sequencing is a powerful tool, but suffers biases and errors that must be accounted ...
High-throughput sequencing is a powerful tool, but suffers biases and errors that must be accounted ...
Motivation: Batch e↵ects are due to probe-specific systematic variation between groups of samples (b...
Genome projects now generate large-scale data often produced at various time points by different lab...
Abstract Background Combining genomic data sets from multiple studies is advantageous to increase st...
A new statistic for identifying batch effects in high-throughput genomic data that uses guided princ...
High-dimensional genomic data can provide deep insight into biological processes. However, conventio...
Abstract Background Large sample sets of whole genome sequencing with deep coverage are being genera...
Batch effects are technical sources of variation and can confound analysis. While many performance r...
High-throughput technologies are widely used in a variety of biomedical research fields to enable ra...
Principal Component Analysis (PCA) is a multivariate analysis that reduces the complexity of dataset...
It is now known that unwanted noise and unmodeled artifacts such as batch effects can dramatically r...
It is common and advantageous for researchers to combine RNA-seq data from similar studies to increa...
International audienceAbstract Biological, technical, and environmental confounders are ubiquitous i...
It is now known that unwanted noise and unmodeled artifacts such as batch effects can dramatically r...
High-throughput sequencing is a powerful tool, but suffers biases and errors that must be accounted ...
High-throughput sequencing is a powerful tool, but suffers biases and errors that must be accounted ...
Motivation: Batch e↵ects are due to probe-specific systematic variation between groups of samples (b...
Genome projects now generate large-scale data often produced at various time points by different lab...
Abstract Background Combining genomic data sets from multiple studies is advantageous to increase st...
A new statistic for identifying batch effects in high-throughput genomic data that uses guided princ...
High-dimensional genomic data can provide deep insight into biological processes. However, conventio...
Abstract Background Large sample sets of whole genome sequencing with deep coverage are being genera...
Batch effects are technical sources of variation and can confound analysis. While many performance r...
High-throughput technologies are widely used in a variety of biomedical research fields to enable ra...
Principal Component Analysis (PCA) is a multivariate analysis that reduces the complexity of dataset...
It is now known that unwanted noise and unmodeled artifacts such as batch effects can dramatically r...
It is common and advantageous for researchers to combine RNA-seq data from similar studies to increa...
International audienceAbstract Biological, technical, and environmental confounders are ubiquitous i...
It is now known that unwanted noise and unmodeled artifacts such as batch effects can dramatically r...
High-throughput sequencing is a powerful tool, but suffers biases and errors that must be accounted ...
High-throughput sequencing is a powerful tool, but suffers biases and errors that must be accounted ...