We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genomes resulting in 707,311 clusters of one or more sequences of which 224,442 ranged in size from 2 to 2,894 sequences. To our knowledge this is the first study of this scale. We were surprised to find that no single cluster contained a representative sequence from all the organisms in the study. Given the minimal genome concept, we expected to find a shared set of proteins. To determine why the clusters did not have universal representation we chose four essential proteins, the chaperonin GroEL, DNA dependent RNA polymerase subunits beta and beta′ (RpoB/RpoB′), and DNA polymerase I (PolA), representing fundamental cellular functions, and examine...
The functional repertoire of a cell is largely embodied in its proteome, the collection of proteins ...
Abstract Background Current protein clustering methods rely on either sequence or functional similar...
Abstract Background The increasing availability of whole genome sequences allows the gene or protein...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
Thousands of whole-genome and whole-proteome sequences have been made available through advances in ...
Abstract Proteome-scale bioinformatics research is increasingly conducted as the number of completel...
Thousands of whole-genome and whole-proteome sequences have been made available through advances in ...
The functional repertoire of a cell is largely embodied in its proteome, the collection of proteins ...
Abstract Background Current protein clustering methods rely on either sequence or functional similar...
Abstract Background The increasing availability of whole genome sequences allows the gene or protein...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
Thousands of whole-genome and whole-proteome sequences have been made available through advances in ...
Abstract Proteome-scale bioinformatics research is increasingly conducted as the number of completel...
Thousands of whole-genome and whole-proteome sequences have been made available through advances in ...
The functional repertoire of a cell is largely embodied in its proteome, the collection of proteins ...
Abstract Background Current protein clustering methods rely on either sequence or functional similar...
Abstract Background The increasing availability of whole genome sequences allows the gene or protein...