<p>Cluster size distribution of α/β hydrolases (abH), short-chain dehydrogenases/reductases (SDR), ω-transaminases (oTA), cytochrome P450 monooxygenases (CYP), thiamine diphosphate-dependent decarboxylases (DC), and β-hydroxyacid dehydrogenases/imine reductases (bHAD) follow a power law distribution: N(s) ~s<sup>-τ</sup> (N(s), number of clusters of size s; τ, Fisher exponent). Cluster criterion: 60% global sequence identity.</p
<p>A. Hierarchical clustering for the normalized data. Biological replicate samples cluster closely ...
<p>The <i>x</i>-axis is the size of a cluster defined by the number of non-redundant sequences at 90...
<p>The left figure shows the number of clusters by organisms at the level of main domains of life (A...
<p>Distributions of pairwise global sequence identity for the protein families of α/β-hydrolases (ab...
<p>Numbers within the boxes represent the percentage of strains from each group having the complete ...
<p>For each sequence-structure map , generated by potential , plots present the expected size of seq...
<div><p>The <i>x</i>-axis is logarithm of the cluster size <i>X</i> and the <i>y</i>-a...
<p>(A) Distribution of the coverage rate of CD by H2CD (N/l<sub>CD</sub>). (C) Distribution of the c...
<p>Distribution of the detection breadth (DB) among unified housekeeping (HK) genes (red bar) and th...
<p>Comparison of the relative abundances of 6-mer in the different datasets using hierarchical clust...
<p>A: Percentage of proteins with a certain number of TMHs. Percentage of SC-clusters (B) and HIS cl...
<p>The enzymes are distributed among the compartments according to beta distributions with a common ...
<p>Hierarchical clustering on the ‘core’ dataset (A) as well as on the union (B) of all identified a...
<p>STRUCTURE analysis of the proportion of each isolate’s SNP profile attributed to each of the diff...
<p>Cluster size distributions for (a) constant and and (b) constant and taken from 400 tracer tr...
<p>A. Hierarchical clustering for the normalized data. Biological replicate samples cluster closely ...
<p>The <i>x</i>-axis is the size of a cluster defined by the number of non-redundant sequences at 90...
<p>The left figure shows the number of clusters by organisms at the level of main domains of life (A...
<p>Distributions of pairwise global sequence identity for the protein families of α/β-hydrolases (ab...
<p>Numbers within the boxes represent the percentage of strains from each group having the complete ...
<p>For each sequence-structure map , generated by potential , plots present the expected size of seq...
<div><p>The <i>x</i>-axis is logarithm of the cluster size <i>X</i> and the <i>y</i>-a...
<p>(A) Distribution of the coverage rate of CD by H2CD (N/l<sub>CD</sub>). (C) Distribution of the c...
<p>Distribution of the detection breadth (DB) among unified housekeeping (HK) genes (red bar) and th...
<p>Comparison of the relative abundances of 6-mer in the different datasets using hierarchical clust...
<p>A: Percentage of proteins with a certain number of TMHs. Percentage of SC-clusters (B) and HIS cl...
<p>The enzymes are distributed among the compartments according to beta distributions with a common ...
<p>Hierarchical clustering on the ‘core’ dataset (A) as well as on the union (B) of all identified a...
<p>STRUCTURE analysis of the proportion of each isolate’s SNP profile attributed to each of the diff...
<p>Cluster size distributions for (a) constant and and (b) constant and taken from 400 tracer tr...
<p>A. Hierarchical clustering for the normalized data. Biological replicate samples cluster closely ...
<p>The <i>x</i>-axis is the size of a cluster defined by the number of non-redundant sequences at 90...
<p>The left figure shows the number of clusters by organisms at the level of main domains of life (A...