Based on the SYSTERS protein family database, we present taxon-related protein family frequencies and distributions. A set of taxon-related protein families is a subset of the whole family set with respect to one taxon, where taxon is not restricted to the species level but may be any rank in the taxonomy. We examine eight ranks in the lineages of seven organisms. A strong linear correlation is observed between the total number of different families and the number of sequences in the data set under consideration. We fitted the generalised power-law function to protein family distributions in a least-squares sense excluding singleton frequencies. Taxon-related family distributions tend to have the same shape and a negative slope being not la...
<p>Losses (left-pointing triangles) and gains (right-pointing triangles) are shown as calculated by ...
<div><p>The currently known protein sequences are not distributed equally in sequence space, but clu...
Abstract Background Previous methods of detecting the taxonomic origins of arbitrary sequence collec...
Based on the SYSTERS protein family database, we present taxon-related protein family frequencies an...
A protein family contains sequences that are evolutionarily related. Generally, this is reflected by...
ABSTRACT It has been observed that the size of protein sequence families is unevenly distrib-uted, w...
(A) Phylogenetic heat tree of proteins in FUnkFams generated with Metacoder [10]. Each FUnkFams prot...
<p>For each of the 1,219 domain superfamilies and their profile of abundance in the 38 genomes, we c...
41 pages, 16 figuresInternational audienceAmong several quantitative invariants found in evolutionar...
The completion of a substantial number of complete genome sequencing initiatives has produced more t...
Christine Vogel is with Medical Research Council Laboratory of Molecular Biology and UT Austin, Cyru...
Several proteins that have substantially diverged during evolution retain similar three-dimensional ...
The cross-phylum distribution of the prion-like protein family representative sequences is shown for...
Contains fulltext : 36341.pdf (publisher's version ) (Closed access)The gap betwee...
Abstract Background New computational resources are needed to manage the increasing volume of biolog...
<p>Losses (left-pointing triangles) and gains (right-pointing triangles) are shown as calculated by ...
<div><p>The currently known protein sequences are not distributed equally in sequence space, but clu...
Abstract Background Previous methods of detecting the taxonomic origins of arbitrary sequence collec...
Based on the SYSTERS protein family database, we present taxon-related protein family frequencies an...
A protein family contains sequences that are evolutionarily related. Generally, this is reflected by...
ABSTRACT It has been observed that the size of protein sequence families is unevenly distrib-uted, w...
(A) Phylogenetic heat tree of proteins in FUnkFams generated with Metacoder [10]. Each FUnkFams prot...
<p>For each of the 1,219 domain superfamilies and their profile of abundance in the 38 genomes, we c...
41 pages, 16 figuresInternational audienceAmong several quantitative invariants found in evolutionar...
The completion of a substantial number of complete genome sequencing initiatives has produced more t...
Christine Vogel is with Medical Research Council Laboratory of Molecular Biology and UT Austin, Cyru...
Several proteins that have substantially diverged during evolution retain similar three-dimensional ...
The cross-phylum distribution of the prion-like protein family representative sequences is shown for...
Contains fulltext : 36341.pdf (publisher's version ) (Closed access)The gap betwee...
Abstract Background New computational resources are needed to manage the increasing volume of biolog...
<p>Losses (left-pointing triangles) and gains (right-pointing triangles) are shown as calculated by ...
<div><p>The currently known protein sequences are not distributed equally in sequence space, but clu...
Abstract Background Previous methods of detecting the taxonomic origins of arbitrary sequence collec...