High-throughput sequencing is a powerful tool, but suffers biases and errors that must be accounted for to prevent false biological conclusions. Such errors include batch effects, technical errors only present in subsets of data due to procedural changes within a study. If overlooked and multiple batches of data are combined, spurious biological signals can arise, particularly if batches of data are correlated with biological variables. Batch effects can be minimized through randomisation of sample groups across batches. However, in long-term or multi-year studies where data are added incrementally, full randomisation is impossible and batch effects may be a common feature. Here we present a case study where false signals of selection were ...
Population differentiation (PD) and ecological association (EA) tests have recently emerged as promi...
It is well-known, but frequently overlooked, that low- and high-throughput molecular data may contai...
[Background] Contaminant DNA is a well-known confounding factor in molecular biology and in genomic ...
High-throughput sequencing is a powerful tool, but suffers biases and errors that must be accounted ...
High-throughput sequencing is a powerful tool, but suffers biases and errors that must be accounted ...
International audienceThe increasing access to high-throughput sequencing is certainly one of the ma...
Abstract Background Large sample sets of whole genome sequencing with deep coverage are being genera...
It is often unavoidable to combine data from different sequencing centers or sequencing platforms wh...
It is often unavoidable to combine data from different sequencing centers or sequencing platforms wh...
Genetic studies have shifted to sequencing-based rare variants discovery after decades of success in...
Abstract Background Combining genomic data sets from multiple studies is advantageous to increase st...
Batch effects (BEs) are technical biases that may confound analysis of high-throughput biotechnologi...
The 1000 Genomes Project (1000G) is one of the most popular whole genome sequencing datasets used in...
Abstract Background The Cancer Genome Atlas (TCGA) is a comprehensive database that includes multi-l...
The Affymetrix GeneChip Human Mapping 500K array is common for genome-wide association studies (GWAS...
Population differentiation (PD) and ecological association (EA) tests have recently emerged as promi...
It is well-known, but frequently overlooked, that low- and high-throughput molecular data may contai...
[Background] Contaminant DNA is a well-known confounding factor in molecular biology and in genomic ...
High-throughput sequencing is a powerful tool, but suffers biases and errors that must be accounted ...
High-throughput sequencing is a powerful tool, but suffers biases and errors that must be accounted ...
International audienceThe increasing access to high-throughput sequencing is certainly one of the ma...
Abstract Background Large sample sets of whole genome sequencing with deep coverage are being genera...
It is often unavoidable to combine data from different sequencing centers or sequencing platforms wh...
It is often unavoidable to combine data from different sequencing centers or sequencing platforms wh...
Genetic studies have shifted to sequencing-based rare variants discovery after decades of success in...
Abstract Background Combining genomic data sets from multiple studies is advantageous to increase st...
Batch effects (BEs) are technical biases that may confound analysis of high-throughput biotechnologi...
The 1000 Genomes Project (1000G) is one of the most popular whole genome sequencing datasets used in...
Abstract Background The Cancer Genome Atlas (TCGA) is a comprehensive database that includes multi-l...
The Affymetrix GeneChip Human Mapping 500K array is common for genome-wide association studies (GWAS...
Population differentiation (PD) and ecological association (EA) tests have recently emerged as promi...
It is well-known, but frequently overlooked, that low- and high-throughput molecular data may contai...
[Background] Contaminant DNA is a well-known confounding factor in molecular biology and in genomic ...