Recent studies have noted extensive inconsistencies in gene start sites among orthologous genes in related microbial genomes. Here we provide the first documented evidence that imposing gene start consistency improves the accuracy of gene start-site prediction. We applied an algorithm using a genome majority vote (GMV) scheme to increase the consistency of gene starts among orthologs. We used a set of validated Escherichia coli genes as a standard to quantify accuracy. Results showed that the GMV algorithm can correct hundreds of gene prediction errors in sets of five or ten genomes while introducing few errors. Using a conservative calculation, we project that GMV would resolve many inconsistencies and errors in publicly available microbia...
Consistency modeling for gene selection is a new topic emerging from recent cancer bioinformatics re...
Predicting protein-coding genes still remains a significant challenge. Although a variety of computa...
Automatic gene prediction is one of the major challenges in computational sequence analysis. Traditi...
Michael E. Wall is with Los Alamos National Laboratory, Sindhu Raghavan is with UT Austin and Los Al...
Abstract Background Evolutionary divergence in the position of the translational start site among or...
<div><p>Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene...
Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene annotat...
This paper is supposed to bridge the gap between practical experience in using GeneMark for a rapidl...
Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene annotat...
Modern gene location prediction techniques are able to achieve near-perfect accuracy for prokaryotic...
With an overwhelming amount of genetic data now becoming publicly available, there is a growing need...
Summary: Genomes of emerging model organisms are now being sequenced at very low cost. However, obta...
While the genomes of many organisms have been sequenced over the last few years, transforming such r...
The availability of whole genome sequence data presents an opportunity to improve the accuracy of ge...
Next-generation sequencing has generated enormous amount of DNA and RNA sequences that potentially c...
Consistency modeling for gene selection is a new topic emerging from recent cancer bioinformatics re...
Predicting protein-coding genes still remains a significant challenge. Although a variety of computa...
Automatic gene prediction is one of the major challenges in computational sequence analysis. Traditi...
Michael E. Wall is with Los Alamos National Laboratory, Sindhu Raghavan is with UT Austin and Los Al...
Abstract Background Evolutionary divergence in the position of the translational start site among or...
<div><p>Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene...
Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene annotat...
This paper is supposed to bridge the gap between practical experience in using GeneMark for a rapidl...
Nowadays, prokaryotic genomes are sequenced faster than the capacity to manually curate gene annotat...
Modern gene location prediction techniques are able to achieve near-perfect accuracy for prokaryotic...
With an overwhelming amount of genetic data now becoming publicly available, there is a growing need...
Summary: Genomes of emerging model organisms are now being sequenced at very low cost. However, obta...
While the genomes of many organisms have been sequenced over the last few years, transforming such r...
The availability of whole genome sequence data presents an opportunity to improve the accuracy of ge...
Next-generation sequencing has generated enormous amount of DNA and RNA sequences that potentially c...
Consistency modeling for gene selection is a new topic emerging from recent cancer bioinformatics re...
Predicting protein-coding genes still remains a significant challenge. Although a variety of computa...
Automatic gene prediction is one of the major challenges in computational sequence analysis. Traditi...