*<p><i>n</i><sub>new_veri</sub> denotes the number of verified genes newly added in updated snapshot of SGD database; <i>n</i><sub>predicted</sub> denotes the overall predicted protein-coding ORFs based on every historical version of SGD database. Take the first snapshot as example, we predict 1784 coding ORFs based on the data of 2004, which covered 99.21% of the 127 newly added verified genes in 2005.</p
Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene annotat...
<div><p>Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene...
<p>Starting with the <i>D. melanogaster</i> reference genome (release 5.41), the sequence was cut in...
<p>Before re-annotation category include: Known (1854), Hypothetical (781) and Putative (47). After ...
<p><b>a-d)</b> ORF length and coding score for ORFs in different sequence types. <i>De novo</i> gene...
(A) Number of publications per gene for past and recent research. Publications of past research (unt...
(A) Change in the distribution of knownness of the 13,421 clusters that contain at least 1 protein f...
Human gene catalogs are fundamental to the study of human biology and medicine. But they are all bas...
<div><p>Human gene catalogs are fundamental to the study of human biology and medicine. But they are...
Data used in the human de novo ORF paper (Dowling et al. 2020 Genome Biology and Evolution evaa194)....
Summary: Genomes of emerging model organisms are now being sequenced at very low cost. However, obta...
(A) Change in the distribution of knownness of the 7,515 clusters that contain at least 1 protein fr...
Motivation: A central problem in bioinformatics is the assignment of function to sequenced open read...
Motivation: A central problem in bioinformatics is the assignment of function to sequenced open read...
Protein structure predictions based on sequence normally use a reference database to carry out s...
Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene annotat...
<div><p>Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene...
<p>Starting with the <i>D. melanogaster</i> reference genome (release 5.41), the sequence was cut in...
<p>Before re-annotation category include: Known (1854), Hypothetical (781) and Putative (47). After ...
<p><b>a-d)</b> ORF length and coding score for ORFs in different sequence types. <i>De novo</i> gene...
(A) Number of publications per gene for past and recent research. Publications of past research (unt...
(A) Change in the distribution of knownness of the 13,421 clusters that contain at least 1 protein f...
Human gene catalogs are fundamental to the study of human biology and medicine. But they are all bas...
<div><p>Human gene catalogs are fundamental to the study of human biology and medicine. But they are...
Data used in the human de novo ORF paper (Dowling et al. 2020 Genome Biology and Evolution evaa194)....
Summary: Genomes of emerging model organisms are now being sequenced at very low cost. However, obta...
(A) Change in the distribution of knownness of the 7,515 clusters that contain at least 1 protein fr...
Motivation: A central problem in bioinformatics is the assignment of function to sequenced open read...
Motivation: A central problem in bioinformatics is the assignment of function to sequenced open read...
Protein structure predictions based on sequence normally use a reference database to carry out s...
Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene annotat...
<div><p>Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene...
<p>Starting with the <i>D. melanogaster</i> reference genome (release 5.41), the sequence was cut in...