DNA sample contamination is a frequent problem in DNA sequencing studies and can result in genotyping errors and reduced power for association testing. We recently described methods to identify within-species DNA sample contamination based on sequencing read data, showed that our methods can reliably detect and estimate contamination levels as low as 1%, and suggested strategies to identify and remove contaminated samples from sequencing studies. Here we propose methods to model contamination during genotype calling as an alternative to removal of contaminated samples from further analyses. We compare our contamination-adjusted calls to calls that ignore contamination and to calls based on uncontaminated data. We demonstrate that, for moder...
Abstract Background Researchers typically sequence a given individual multiple times, either re-sequ...
Metagenomic next-generation sequencing (mNGS) experiments involving small amounts of nucleic acid in...
Contaminating sequences in public genome databases is a pervasive issue with potentially far-reachin...
DNA sample contamination is a frequent problem in DNA sequencing studies and can result in genotypin...
Array genotyping is a cost‐effective and widely used tool that enables assessment of up to millions ...
BACKGROUND: Contaminant DNA is a well-known confounding factor in molecular biology and in genomic r...
[Background] Contaminant DNA is a well-known confounding factor in molecular biology and in genomic ...
Trace quantities of contaminating DNA are widespread in the laboratory environment, but their presen...
<div><p>Trace quantities of contaminating DNA are widespread in the laboratory environment, but thei...
peer reviewedThe decreasing cost of sequencing and concomitant augmentation of publicly available ge...
Polymorphism discovery is a routine application of next-generation sequencing technology where multi...
In recent years, the high throughput and the low cost of next-generation sequencing (NGS) technologi...
The rapidly increasing throughput of sequencing technologies allows us to sequence genomes, transcri...
Contaminations frequently occur in bacterial cultures, which significantly affect the reproducibilit...
Abstract Background Large sample sets of whole genome sequencing with deep coverage are being genera...
Abstract Background Researchers typically sequence a given individual multiple times, either re-sequ...
Metagenomic next-generation sequencing (mNGS) experiments involving small amounts of nucleic acid in...
Contaminating sequences in public genome databases is a pervasive issue with potentially far-reachin...
DNA sample contamination is a frequent problem in DNA sequencing studies and can result in genotypin...
Array genotyping is a cost‐effective and widely used tool that enables assessment of up to millions ...
BACKGROUND: Contaminant DNA is a well-known confounding factor in molecular biology and in genomic r...
[Background] Contaminant DNA is a well-known confounding factor in molecular biology and in genomic ...
Trace quantities of contaminating DNA are widespread in the laboratory environment, but their presen...
<div><p>Trace quantities of contaminating DNA are widespread in the laboratory environment, but thei...
peer reviewedThe decreasing cost of sequencing and concomitant augmentation of publicly available ge...
Polymorphism discovery is a routine application of next-generation sequencing technology where multi...
In recent years, the high throughput and the low cost of next-generation sequencing (NGS) technologi...
The rapidly increasing throughput of sequencing technologies allows us to sequence genomes, transcri...
Contaminations frequently occur in bacterial cultures, which significantly affect the reproducibilit...
Abstract Background Large sample sets of whole genome sequencing with deep coverage are being genera...
Abstract Background Researchers typically sequence a given individual multiple times, either re-sequ...
Metagenomic next-generation sequencing (mNGS) experiments involving small amounts of nucleic acid in...
Contaminating sequences in public genome databases is a pervasive issue with potentially far-reachin...