Contamination in genome assembly can lead to wrong or confusing results when using such genome as reference in sequence comparison. Although bacterial contamination is well known, the problem of human-originated contamination received little attention. In this study we surveyed 45,735 available genome assemblies for evidence of human contamination. We used lineage specificity to distinguish between contamination and conservation. We found that 154 genome assemblies contain fragments that with high confidence originate as contamination from human DNA. Majority of contaminating human sequences were present in the reference human genome assembly for over a decade. We recommend that existing contaminated genomes should be revised to remove cont...
High-throughput sequencing technologies have strongly impacted microbiology, providing a rapid and c...
[Background] Contaminant DNA is a well-known confounding factor in molecular biology and in genomic ...
High-throughput sequencing provides a fast and cost-effective mean to recover genomes of organisms f...
During routine screens of the NCBI databases using human repetitive elements we discovered an unlike...
In recent years, the high throughput and the low cost of next-generation sequencing (NGS) technologi...
Thanks to huge advances in sequencing technologies, genomic resources are increasingly being generat...
peer reviewedThe decreasing cost of sequencing and concomitant augmentation of publicly available ge...
Contaminating sequences in public genome databases is a pervasive issue with potentially far-reachin...
Trace quantities of contaminating DNA are widespread in the laboratory environment, but their presen...
<div><p>Trace quantities of contaminating DNA are widespread in the laboratory environment, but thei...
BACKGROUND: Contaminant DNA is a well-known confounding factor in molecular biology and in genomic r...
The raw data from a genome sequencing project sometimes contains DNA from contaminating organisms, w...
Background: In silco Biology is increasingly important and is often based on public data. While the ...
Background: In silco Biology is increasingly important and is often based on public data. While the ...
<div><p>The high level of accuracy and sensitivity of next generation sequencing for quantifying gen...
High-throughput sequencing technologies have strongly impacted microbiology, providing a rapid and c...
[Background] Contaminant DNA is a well-known confounding factor in molecular biology and in genomic ...
High-throughput sequencing provides a fast and cost-effective mean to recover genomes of organisms f...
During routine screens of the NCBI databases using human repetitive elements we discovered an unlike...
In recent years, the high throughput and the low cost of next-generation sequencing (NGS) technologi...
Thanks to huge advances in sequencing technologies, genomic resources are increasingly being generat...
peer reviewedThe decreasing cost of sequencing and concomitant augmentation of publicly available ge...
Contaminating sequences in public genome databases is a pervasive issue with potentially far-reachin...
Trace quantities of contaminating DNA are widespread in the laboratory environment, but their presen...
<div><p>Trace quantities of contaminating DNA are widespread in the laboratory environment, but thei...
BACKGROUND: Contaminant DNA is a well-known confounding factor in molecular biology and in genomic r...
The raw data from a genome sequencing project sometimes contains DNA from contaminating organisms, w...
Background: In silco Biology is increasingly important and is often based on public data. While the ...
Background: In silco Biology is increasingly important and is often based on public data. While the ...
<div><p>The high level of accuracy and sensitivity of next generation sequencing for quantifying gen...
High-throughput sequencing technologies have strongly impacted microbiology, providing a rapid and c...
[Background] Contaminant DNA is a well-known confounding factor in molecular biology and in genomic ...
High-throughput sequencing provides a fast and cost-effective mean to recover genomes of organisms f...