The multiple species de novo gene prediction problem can be stated as follows: given an alignment of genomic sequences from two or more organisms, predict the location and structure of all protein-coding genes in one or more of the sequences. Here, we present a new system, N-SCAN (a.k.a. TWINSCAN 3.0), for addressing this problem. N-SCAN can model the phylogenetic relationships between the aligned genome sequences, context-dependent substitution rates, and insertions and deletions. An implementation of N-SCAN was created and used to generate predictions for the entire human genome and the genome of the fruit fly Drosophila melanogaster. Analyses of the predictions reveal that N-SCAN’s accuracy in both human and fly exceeds that of all previ...
Stoye J, Evers D, Meyer F. Generating benchmarks for multiple sequence alignments and phylogenetic r...
With the rapid development of genome sequencing, an ever-increasing number of molecular biology anal...
A central focus of computational biology is to organize and make use of vast stores of molecular seq...
[[abstract]]Identifying protein coding genes is one of most important task in newly sequenced genome...
Given the increasing number of available genomic sequences, one now faces the task of identifying th...
Automatic gene prediction is one of the major challenges in computational sequence analysis. Traditi...
Taher L, Rinner O, Garg S, Sczyrba A, Morgenstern B. AGenDA: gene prediction by cross-species sequen...
ABSTRACT As whole genome sequencing is taking on ever-increasing dimensions, the new challenge is th...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
Whole genome alignments have become a central tool in biological sequence analy-sis. A major applica...
Abstract: Current methods for high-throughput automatic annotation of newly sequenced genomes are la...
Predicting protein-coding genes still remains a significant challenge. Although a variety of computa...
We performed benchmarks of phylogenetic grammar-based ncRNA gene prediction, experimenting with eigh...
[[abstract]]GeneAlign is a coding exon prediction tool for predicting protein coding genes by measur...
We performed benchmarks of phylogenetic grammar-based ncRNA gene prediction, experimenting with eigh...
Stoye J, Evers D, Meyer F. Generating benchmarks for multiple sequence alignments and phylogenetic r...
With the rapid development of genome sequencing, an ever-increasing number of molecular biology anal...
A central focus of computational biology is to organize and make use of vast stores of molecular seq...
[[abstract]]Identifying protein coding genes is one of most important task in newly sequenced genome...
Given the increasing number of available genomic sequences, one now faces the task of identifying th...
Automatic gene prediction is one of the major challenges in computational sequence analysis. Traditi...
Taher L, Rinner O, Garg S, Sczyrba A, Morgenstern B. AGenDA: gene prediction by cross-species sequen...
ABSTRACT As whole genome sequencing is taking on ever-increasing dimensions, the new challenge is th...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
Whole genome alignments have become a central tool in biological sequence analy-sis. A major applica...
Abstract: Current methods for high-throughput automatic annotation of newly sequenced genomes are la...
Predicting protein-coding genes still remains a significant challenge. Although a variety of computa...
We performed benchmarks of phylogenetic grammar-based ncRNA gene prediction, experimenting with eigh...
[[abstract]]GeneAlign is a coding exon prediction tool for predicting protein coding genes by measur...
We performed benchmarks of phylogenetic grammar-based ncRNA gene prediction, experimenting with eigh...
Stoye J, Evers D, Meyer F. Generating benchmarks for multiple sequence alignments and phylogenetic r...
With the rapid development of genome sequencing, an ever-increasing number of molecular biology anal...
A central focus of computational biology is to organize and make use of vast stores of molecular seq...