<p>For each bacterial genome in a set of 747 genomes, we simulated several read lengths (50 nt, 75 nt, 100 nt, 150 nt, 200 nt, 250 nt) and several substitution error rates (0%, 1%, 5%, 10%). Independent samples of 5, 10, 25, 50, 100, 200, or 300 random reads were used in each query and the distribution of the rank of the correct references in the list recorded; a rank of means that the correct reference was at the very top of the list. The list of hits has a maximum length of 25 and we count the reference as ‘not found’ if it not present in the list. The percentages of correct test bacterial genomes found in that list are represented in a bar plot nested on the right side of each panel. Increasing the number of reads in the random sample b...
An analytical model based on the statistical properties of Open Reading Frames (ORFs) of eubacterial...
Mapping short reads against a reference genome is classically the first step of many next-generation...
International audienceUltra high-throughput sequencing is used to analyse the transcriptome or inter...
<p>For each bacterial genome in a set of 747 genomes, we simulated several read lengths (50 nt, 75 n...
<p>Percentage of matches giving the correct specie, that is a reference in our collection that belon...
<p>Average rank (, x-axis) and standard deviation of the rank (, y-axis) of the correct reference wh...
<p>(A) Simulated metagenomes (50–500 bp; 1% error rate; mock community 160319967-stool1) were search...
<p>Mean number of reads (standard error of the mean) are plotted for the 28 different genes from the...
<p>Reads of various length (70–3,000 bp) were simulated with 1% error rate from mock community 16031...
<p>The number of the sequence reads mapped to the reference genome (<i>Cryptococcus neoformans</i> v...
(a, b) Count percentage of (a) reads and of (b) bases as a function of read length obtained from gen...
An analytical model based on the statistical properties of Open Reading Frames (ORFs) of eubacterial...
<div><p>An analytical model based on the statistical properties of Open Reading Frames (ORFs) of eub...
<p>Conversion of data to relative percent of population based on the number of 16s rRNA operons in e...
Recently Whole Genome Sequencing (WGS) has become the new high-resolution tool used to trace the sou...
An analytical model based on the statistical properties of Open Reading Frames (ORFs) of eubacterial...
Mapping short reads against a reference genome is classically the first step of many next-generation...
International audienceUltra high-throughput sequencing is used to analyse the transcriptome or inter...
<p>For each bacterial genome in a set of 747 genomes, we simulated several read lengths (50 nt, 75 n...
<p>Percentage of matches giving the correct specie, that is a reference in our collection that belon...
<p>Average rank (, x-axis) and standard deviation of the rank (, y-axis) of the correct reference wh...
<p>(A) Simulated metagenomes (50–500 bp; 1% error rate; mock community 160319967-stool1) were search...
<p>Mean number of reads (standard error of the mean) are plotted for the 28 different genes from the...
<p>Reads of various length (70–3,000 bp) were simulated with 1% error rate from mock community 16031...
<p>The number of the sequence reads mapped to the reference genome (<i>Cryptococcus neoformans</i> v...
(a, b) Count percentage of (a) reads and of (b) bases as a function of read length obtained from gen...
An analytical model based on the statistical properties of Open Reading Frames (ORFs) of eubacterial...
<div><p>An analytical model based on the statistical properties of Open Reading Frames (ORFs) of eub...
<p>Conversion of data to relative percent of population based on the number of 16s rRNA operons in e...
Recently Whole Genome Sequencing (WGS) has become the new high-resolution tool used to trace the sou...
An analytical model based on the statistical properties of Open Reading Frames (ORFs) of eubacterial...
Mapping short reads against a reference genome is classically the first step of many next-generation...
International audienceUltra high-throughput sequencing is used to analyse the transcriptome or inter...