We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genomes resulting in 707,311 clusters of one or more sequences of which 224,442 ranged in size from 2 to 2,894 sequences. To our knowledge this is the first study of this scale. We were surprised to find that no single cluster contained a representative sequence from all the organisms in the study. Given the minimal genome concept, we expected to find a shared set of proteins. To determine why the clusters did not have universal representation we chose four essential proteins, the chaperonin GroEL, DNA dependent RNA polymerase subunits beta and beta′ (RpoB/RpoB′), and DNA polymerase I (PolA), representing fundamental cellular functions, and examine...
The functional repertoire of a cell is largely embodied in its proteome, the collection of proteins ...
Abstract Background Current protein clustering methods rely on either sequence or functional similar...
Next-generation sequencing has allowed many new protein sequences to be identified. However, this ex...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
Abstract Proteome-scale bioinformatics research is increasingly conducted as the number of completel...
Thousands of whole-genome and whole-proteome sequences have been made available through advances in ...
Thousands of whole-genome and whole-proteome sequences have been made available through advances in ...
The functional repertoire of a cell is largely embodied in its proteome, the collection of proteins ...
Abstract Background Current protein clustering methods rely on either sequence or functional similar...
Next-generation sequencing has allowed many new protein sequences to be identified. However, this ex...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
We clustered 8.76 M protein sequences deduced from 2,307 completely sequenced Proteobacterial genome...
Abstract Proteome-scale bioinformatics research is increasingly conducted as the number of completel...
Thousands of whole-genome and whole-proteome sequences have been made available through advances in ...
Thousands of whole-genome and whole-proteome sequences have been made available through advances in ...
The functional repertoire of a cell is largely embodied in its proteome, the collection of proteins ...
Abstract Background Current protein clustering methods rely on either sequence or functional similar...
Next-generation sequencing has allowed many new protein sequences to be identified. However, this ex...