Background: A better understanding of the size and abundance of open reading frames (ORFS) in whole genomes may shed light on the factors that control genome complexity. Here we examine the statistical distributions of open reading frames (i.e. distribution of start and stop codons) in the fully sequenced genomes of 297 prokaryotes, and 14 eukaryotes. Methodology/Principal Findings: By fitting mixture models to data from whole genome sequences we show that the size-frequency distributions for ORFS are strikingly similar across prokaryotic and eukaryotic genomes. Moreover, we show that i) a large fraction (60–80%) of ORF size-frequency distributions can be predicted a priori with a stochastic assembly model based on GC content, and that (ii)...
Repeats or Transposable Elements (TEs) are highly repeated sequence stretches, present in virtually ...
Motivation: Obtaining an accurate estimation of the number of protein coding genes and the number of...
Xia et al. (2003) discuss whether the lengths of exons in eukaryotes and of genes in prokaryotes var...
BACKGROUND: A better understanding of the size and abundance of open reading frames (ORFS) in whole ...
Background: A better understanding of the size and abundance of open reading frames (ORFS) in whole ...
Some statistical properties of open reading frames in all currently available complete genome sequen...
An analytical model based on the statistical properties of Open Reading Frames (ORFs) of eubacterial...
An analytical model based on the statistical properties of Open Reading Frames (ORFs) of eubacterial...
Data on the number of Open Reading Frames (ORFs) coded by genomes from the 3 domains of Life show th...
BACKGROUND: Data on the number of Open Reading Frames (ORFs) coded by genomes from the 3 domains of ...
In this paper, we report an analysis of the protein sequence length distribution for 13 bacteria, fo...
We compare the frequency distribution of gene family sizes in the complete genomes of six bacteria (...
We review fold usage on completed genomes to explore protein structure evolution. The patterns of pr...
Background: Because prokaryotic genomes experience a rapid flux of genes, selection may act at a hig...
Abstract Background Among microbial genomes, genetic information is frequently compressed, exploitin...
Repeats or Transposable Elements (TEs) are highly repeated sequence stretches, present in virtually ...
Motivation: Obtaining an accurate estimation of the number of protein coding genes and the number of...
Xia et al. (2003) discuss whether the lengths of exons in eukaryotes and of genes in prokaryotes var...
BACKGROUND: A better understanding of the size and abundance of open reading frames (ORFS) in whole ...
Background: A better understanding of the size and abundance of open reading frames (ORFS) in whole ...
Some statistical properties of open reading frames in all currently available complete genome sequen...
An analytical model based on the statistical properties of Open Reading Frames (ORFs) of eubacterial...
An analytical model based on the statistical properties of Open Reading Frames (ORFs) of eubacterial...
Data on the number of Open Reading Frames (ORFs) coded by genomes from the 3 domains of Life show th...
BACKGROUND: Data on the number of Open Reading Frames (ORFs) coded by genomes from the 3 domains of ...
In this paper, we report an analysis of the protein sequence length distribution for 13 bacteria, fo...
We compare the frequency distribution of gene family sizes in the complete genomes of six bacteria (...
We review fold usage on completed genomes to explore protein structure evolution. The patterns of pr...
Background: Because prokaryotic genomes experience a rapid flux of genes, selection may act at a hig...
Abstract Background Among microbial genomes, genetic information is frequently compressed, exploitin...
Repeats or Transposable Elements (TEs) are highly repeated sequence stretches, present in virtually ...
Motivation: Obtaining an accurate estimation of the number of protein coding genes and the number of...
Xia et al. (2003) discuss whether the lengths of exons in eukaryotes and of genes in prokaryotes var...