Background: In silco Biology is increasingly important and is often based on public data. While the problem of contamination is well recognised in microbiology labs the corresponding problem of database corruption has received less attention. Results: Mapping 50 billion next generation DNA sequences from The Thousand Genome Project against published genomes reveals many that match one or more Mycoplasma but are not included in the reference human genome GRCh37.p5. Many of these are of low quality but NCBI BLAST searches confirm some high quality, high entropy sequences match Mycoplasma but no human sequences. Conclusions: It appears at least 7 % of 1000G samples are contaminated
Background: A variety of bacteria are known to influence carcinogenesis. Therefore, we sought to inv...
Trace quantities of contaminating DNA are widespread in the laboratory environment, but their presen...
The importance of using curated microbial reference genome databases. <br> Classi...
Background: In silco Biology is increasingly important and is often based on public data. While the ...
Contamination in genome assembly can lead to wrong or confusing results when using such genome as re...
Unbiased high-throughput sequencing of whole metagenome shotgun DNA libraries is a promising new app...
Unbiased high-throughput sequencing of whole metagenome shotgun DNA libraries is a promising new app...
<div><p>Unbiased high-throughput sequencing of whole metagenome shotgun DNA libraries is a promising...
[Background] Contaminant DNA is a well-known confounding factor in molecular biology and in genomic ...
During routine screens of the NCBI databases using human repetitive elements we discovered an unlike...
Contaminating sequences in public genome databases is a pervasive issue with potentially far-reachin...
Identifying causative disease agents in human patients from shotgun metagenomic sequencing (SMS) pre...
Metagenomic sequencing of patient samples is a very promising method for the diagnosis of human infe...
BACKGROUND: Contaminant DNA is a well-known confounding factor in molecular biology and in genomic r...
peer reviewedThe decreasing cost of sequencing and concomitant augmentation of publicly available ge...
Background: A variety of bacteria are known to influence carcinogenesis. Therefore, we sought to inv...
Trace quantities of contaminating DNA are widespread in the laboratory environment, but their presen...
The importance of using curated microbial reference genome databases. <br> Classi...
Background: In silco Biology is increasingly important and is often based on public data. While the ...
Contamination in genome assembly can lead to wrong or confusing results when using such genome as re...
Unbiased high-throughput sequencing of whole metagenome shotgun DNA libraries is a promising new app...
Unbiased high-throughput sequencing of whole metagenome shotgun DNA libraries is a promising new app...
<div><p>Unbiased high-throughput sequencing of whole metagenome shotgun DNA libraries is a promising...
[Background] Contaminant DNA is a well-known confounding factor in molecular biology and in genomic ...
During routine screens of the NCBI databases using human repetitive elements we discovered an unlike...
Contaminating sequences in public genome databases is a pervasive issue with potentially far-reachin...
Identifying causative disease agents in human patients from shotgun metagenomic sequencing (SMS) pre...
Metagenomic sequencing of patient samples is a very promising method for the diagnosis of human infe...
BACKGROUND: Contaminant DNA is a well-known confounding factor in molecular biology and in genomic r...
peer reviewedThe decreasing cost of sequencing and concomitant augmentation of publicly available ge...
Background: A variety of bacteria are known to influence carcinogenesis. Therefore, we sought to inv...
Trace quantities of contaminating DNA are widespread in the laboratory environment, but their presen...
The importance of using curated microbial reference genome databases. <br> Classi...