The dramatic reduction in the cost of sequencing has allowed many researchers to join in the effort of sequencing and annotating prokaryotic genomes. Annotation methods vary considerably and may fail to identify some genes. Here we draw attention to a large number of likely genes missing from annotations using common tools such as Glimmer and BLAST. By analyzing 1,474 prokaryotic genome annotations in GenBank, we identify 13,602 likely missed genes that are homologs to non-hypothetical proteins, and 11,792 likely missed genes that are homologs only to hypothetical proteins, yet have supporting evidence of their protein-coding nature from COMBREX, a newly created gene function database. We also estimate the likelihood that each potential mis...
The sophistication of gene prediction algorithms and the abundance of RNA-based evidence for the mai...
Background Complete and accurate annotation of sequenced genomes is of paramount importance to their...
Abstract Background Complete and accurate annotation of sequenced genomes is of paramount importance...
COMBREX (http://combrex.bu.edu) is a project to increase the speed of the functional annotation of n...
Abstract Background Experimental verification of gene products has not kept pace with the rapid grow...
Recently, the numbers of prokaryotic genomes are completely sequenced and now more projects are ongo...
With the rapid increase in the number of sequenced prokaryotic genomes, relying on automated gene an...
Large regions of prokaryotic genomes are currently without any annotation, in part due to well-estab...
The annotation of most genomes becomes outdated over time, owing in part to our ever-improving knowl...
Bacterial genome annotations are accumulating rapidly in the GenBank database and the use of automat...
Genome sequences are annotated by computational prediction of coding sequences, followed by similari...
These days, more and more scientists are diving into genome sequencing projects, urged by fast and c...
Background: Annotation of eukaryotic genomes is a complex endeavor that requires the integration of...
BACKGROUND: Reconstruction of biological pathways is typically done through mapping well-characteriz...
The number of newly available viral genomes and metagenomes has increased exponentially since the de...
The sophistication of gene prediction algorithms and the abundance of RNA-based evidence for the mai...
Background Complete and accurate annotation of sequenced genomes is of paramount importance to their...
Abstract Background Complete and accurate annotation of sequenced genomes is of paramount importance...
COMBREX (http://combrex.bu.edu) is a project to increase the speed of the functional annotation of n...
Abstract Background Experimental verification of gene products has not kept pace with the rapid grow...
Recently, the numbers of prokaryotic genomes are completely sequenced and now more projects are ongo...
With the rapid increase in the number of sequenced prokaryotic genomes, relying on automated gene an...
Large regions of prokaryotic genomes are currently without any annotation, in part due to well-estab...
The annotation of most genomes becomes outdated over time, owing in part to our ever-improving knowl...
Bacterial genome annotations are accumulating rapidly in the GenBank database and the use of automat...
Genome sequences are annotated by computational prediction of coding sequences, followed by similari...
These days, more and more scientists are diving into genome sequencing projects, urged by fast and c...
Background: Annotation of eukaryotic genomes is a complex endeavor that requires the integration of...
BACKGROUND: Reconstruction of biological pathways is typically done through mapping well-characteriz...
The number of newly available viral genomes and metagenomes has increased exponentially since the de...
The sophistication of gene prediction algorithms and the abundance of RNA-based evidence for the mai...
Background Complete and accurate annotation of sequenced genomes is of paramount importance to their...
Abstract Background Complete and accurate annotation of sequenced genomes is of paramount importance...