<p>The proportion of sequences with matches in NCBI nr databases is greater among the longer assembled sequences (with a cut-off E-value of 10<sup>−5</sup>).</p
Duplicate sequence records - that is, records having similar or identical sequences - are a challeng...
Motivation: The number of Single Nucleotide Polymorphisms (SNPs) detectable in an alignment is a fun...
Word matches are widely used to compare DNA sequences, especially when the compared sequences are to...
<p>The proportion of sequences with matches (with a cut-off E-value of 1.0E-5) in nr database is gre...
<p>(A) Singleton sequences. (B) Cluster sequences. The proportion of sequences with matches (with a ...
<p>The proportion of sequences with matches (e-value cut off 1e-05) in NCBI embryophytes nr database...
<p>Effect of query unigene length on the numbers of matched unigenes and the percentage of unigenes ...
<p>The x-axis indicates sequence sizes from 200 nt to≥3000 nt. The y-axis indicates the number of un...
MOTIVATION: Database search programs such as FASTA, BLAST or a rigorous Smith-Waterman algorithm pr...
<p>The numbers of unigenes matched (with a cut-off E-value of 10<sup>-5</sup>) in NCBI nr databases ...
<p>(A) The counts of genes and isoforms assembled decreases along with length. The length of genes a...
*<p>Query coverage is percentage of the query length that is included in the aligned segments.</p>*<...
Lowest percentage of sequence identity below which the comparison against the NCBI nt database did n...
<p>Comparison of the number of sequences annotated by SAP with direct BLAST against the NCBI-nr data...
<p>The upper is the aligned sequences number of clementine database, the downer is the aligned seque...
Duplicate sequence records - that is, records having similar or identical sequences - are a challeng...
Motivation: The number of Single Nucleotide Polymorphisms (SNPs) detectable in an alignment is a fun...
Word matches are widely used to compare DNA sequences, especially when the compared sequences are to...
<p>The proportion of sequences with matches (with a cut-off E-value of 1.0E-5) in nr database is gre...
<p>(A) Singleton sequences. (B) Cluster sequences. The proportion of sequences with matches (with a ...
<p>The proportion of sequences with matches (e-value cut off 1e-05) in NCBI embryophytes nr database...
<p>Effect of query unigene length on the numbers of matched unigenes and the percentage of unigenes ...
<p>The x-axis indicates sequence sizes from 200 nt to≥3000 nt. The y-axis indicates the number of un...
MOTIVATION: Database search programs such as FASTA, BLAST or a rigorous Smith-Waterman algorithm pr...
<p>The numbers of unigenes matched (with a cut-off E-value of 10<sup>-5</sup>) in NCBI nr databases ...
<p>(A) The counts of genes and isoforms assembled decreases along with length. The length of genes a...
*<p>Query coverage is percentage of the query length that is included in the aligned segments.</p>*<...
Lowest percentage of sequence identity below which the comparison against the NCBI nt database did n...
<p>Comparison of the number of sequences annotated by SAP with direct BLAST against the NCBI-nr data...
<p>The upper is the aligned sequences number of clementine database, the downer is the aligned seque...
Duplicate sequence records - that is, records having similar or identical sequences - are a challeng...
Motivation: The number of Single Nucleotide Polymorphisms (SNPs) detectable in an alignment is a fun...
Word matches are widely used to compare DNA sequences, especially when the compared sequences are to...