Background: The number of protein family members defined by DNA sequencing is usually much larger than those characterised experimentally. This paper describes a method to divide protein families into subtypes purely on sequence criteria. Comparison with experimental data allows an independent test of the quality of the clustering. Results: An evolutionary split statistic is calculated for each column in a protein multiple sequence alignment; the statistic has a larger value when a column is better described by an evolutionary model that assumes clustering around two or more amino acids rather than a single amino acid. The user selects columns (typically the top ranked columns) to construct a motif. The motif is used to divide the f...
Here we assessed the use of domain families for predicting the functions of whole proteins. These 'f...
The completion of a substantial number of complete genome sequencing initiatives has produced more t...
Classifying proteins into subgroups with similar molecular function on the basis of sequence is an i...
International audienceBACKGROUND: The number of protein family members defined by DNA sequencing is ...
International audienceBACKGROUND: The number of protein family members defined by DNA sequencing is ...
International audienceBACKGROUND: The number of protein family members defined by DNA sequencing is ...
International audienceBACKGROUND: The number of protein family members defined by DNA sequencing is ...
Background: Phylogenetic analysis can be used to divide a protein family into subfamilies in the abs...
Background: Phylogenetic analysis can be used to divide a protein family into subfamilies in the abs...
Background: Phylogenetic analysis can be used to divide a protein family into subfamilies in the abs...
Background: Searching a biological sequence database with a query sequence looking for homologues ha...
Background: Clustering sequences into groups of putative homologs (families) is a critical first ste...
Next-generation sequencing has allowed many new protein sequences to be identified. However, this ex...
Krause A, Stoye J, Vingron M. Large scale hierarchical clustering of protein sequences. BMC Bioinfor...
---> Background: The identification of subfamilies within a protein family is a challenging problem...
Here we assessed the use of domain families for predicting the functions of whole proteins. These 'f...
The completion of a substantial number of complete genome sequencing initiatives has produced more t...
Classifying proteins into subgroups with similar molecular function on the basis of sequence is an i...
International audienceBACKGROUND: The number of protein family members defined by DNA sequencing is ...
International audienceBACKGROUND: The number of protein family members defined by DNA sequencing is ...
International audienceBACKGROUND: The number of protein family members defined by DNA sequencing is ...
International audienceBACKGROUND: The number of protein family members defined by DNA sequencing is ...
Background: Phylogenetic analysis can be used to divide a protein family into subfamilies in the abs...
Background: Phylogenetic analysis can be used to divide a protein family into subfamilies in the abs...
Background: Phylogenetic analysis can be used to divide a protein family into subfamilies in the abs...
Background: Searching a biological sequence database with a query sequence looking for homologues ha...
Background: Clustering sequences into groups of putative homologs (families) is a critical first ste...
Next-generation sequencing has allowed many new protein sequences to be identified. However, this ex...
Krause A, Stoye J, Vingron M. Large scale hierarchical clustering of protein sequences. BMC Bioinfor...
---> Background: The identification of subfamilies within a protein family is a challenging problem...
Here we assessed the use of domain families for predicting the functions of whole proteins. These 'f...
The completion of a substantial number of complete genome sequencing initiatives has produced more t...
Classifying proteins into subgroups with similar molecular function on the basis of sequence is an i...