<p>(<b>a</b>) The probability function of predicting the copy number of a given KO in a given dataset across all simulated 101-bp datasets using the <i>top gene</i> protocol and when the strain from which the reads originated is absent from the database. Only KOs with copy numbers 1 to 4 are illustrated. The curve corresponding to copy number 0 represents false positive KO predictions. The smaller peaks showing in some curves (e.g., the two extra peaks in the blue “1 copy” curve) were found to be due to stretches of intergenic reads that mismapped to KO genes in the database and likely reflect genomic misannotations or pseudogenes. (<b>b</b>) The average recall across all simulated 101-bp datasets for identifying reads originating from each...
<p><b>Copyright information:</b></p><p>Taken from "Identification of homologs in insignificant blast...
<p>(<b>A</b>) Three major clusters. The right-hand annotation indicates, in order, the BRAFm (in yel...
<p>For each year from 2005 to 2013 denoted on the x-axis, the corresponding dataset includes those g...
<p>The (<b>a</b>) precision and (<b>b</b>) recall are illustrated for several protocols for identify...
<p>The phylogenetic tree was obtained from Ref. <a href="http://www.plosone.org/article/info:doi/10....
<p>(<b>a</b>) Simulated sets of reads of length L are generated from curated and annotated reference...
<p>The estimate consists on false negatives (a paralog conserved next to a RBH) divided by the sum o...
<p>Representative sequences from all the 8,099 HGCs were subjected to annotation out of which 47.77%...
The statistical estimates of BLAST and PSI-BLAST are of extreme importance to determine the biologi...
<p><b>Copyright information:</b></p><p>Taken from "Identification of homologs in insignificant blast...
<p>(A) Blast2Go functional annotation results overview. No Blast, Number of sequences without blast ...
High copy number TE families and characteristics of burst initiation: Comparison of copies in bursts...
<p>Note: – represents no hits. Sesame repeat sequences were used as the query. The number in the tab...
International audienceUltra high-throughput sequencing is used to analyse the transcriptome or inter...
The problem of functional annotation of novel sequences has been a sigfinicant issue for many labora...
<p><b>Copyright information:</b></p><p>Taken from "Identification of homologs in insignificant blast...
<p>(<b>A</b>) Three major clusters. The right-hand annotation indicates, in order, the BRAFm (in yel...
<p>For each year from 2005 to 2013 denoted on the x-axis, the corresponding dataset includes those g...
<p>The (<b>a</b>) precision and (<b>b</b>) recall are illustrated for several protocols for identify...
<p>The phylogenetic tree was obtained from Ref. <a href="http://www.plosone.org/article/info:doi/10....
<p>(<b>a</b>) Simulated sets of reads of length L are generated from curated and annotated reference...
<p>The estimate consists on false negatives (a paralog conserved next to a RBH) divided by the sum o...
<p>Representative sequences from all the 8,099 HGCs were subjected to annotation out of which 47.77%...
The statistical estimates of BLAST and PSI-BLAST are of extreme importance to determine the biologi...
<p><b>Copyright information:</b></p><p>Taken from "Identification of homologs in insignificant blast...
<p>(A) Blast2Go functional annotation results overview. No Blast, Number of sequences without blast ...
High copy number TE families and characteristics of burst initiation: Comparison of copies in bursts...
<p>Note: – represents no hits. Sesame repeat sequences were used as the query. The number in the tab...
International audienceUltra high-throughput sequencing is used to analyse the transcriptome or inter...
The problem of functional annotation of novel sequences has been a sigfinicant issue for many labora...
<p><b>Copyright information:</b></p><p>Taken from "Identification of homologs in insignificant blast...
<p>(<b>A</b>) Three major clusters. The right-hand annotation indicates, in order, the BRAFm (in yel...
<p>For each year from 2005 to 2013 denoted on the x-axis, the corresponding dataset includes those g...