Background: Phylogenetic analysis can be used to divide a protein family into subfamilies in the absence of experimental information. Most phylogenetic analysis methods utilize multiple alignment of sequences and are based on an evolutionary model. However, multiple alignment is not an automated procedure and requires human intervention to maintain alignment integrity and to produce phylogenies consistent with the functional splits in underlying sequences. To address this problem, we propose to use the alignment-free Relative Complexity Measure (RCM) combined with reduced amino acid alphabets to cluster protein families into functional subtypes purely on sequence criteria. Comparison with an alignment-based approach was also carried out to ...
With the rapid development of genome sequencing technologies, complete genomes are becoming more ava...
Phylogenetic analysis of proteins using multiple sequence alignment (MSA) assumes an underlying evol...
Abstract Background New computational resources are needed to manage the increasing volume of biolog...
Background: Phylogenetic analysis can be used to divide a protein family into subfamilies in the abs...
Background: Phylogenetic analysis can be used to divide a protein family into subfamilies in the abs...
Background: The number of protein family members defined by DNA sequencing is usually much larger t...
International audienceBACKGROUND: The number of protein family members defined by DNA sequencing is ...
International audienceBACKGROUND: The number of protein family members defined by DNA sequencing is ...
International audienceBACKGROUND: The number of protein family members defined by DNA sequencing is ...
International audienceBACKGROUND: The number of protein family members defined by DNA sequencing is ...
---> Background: The identification of subfamilies within a protein family is a challenging problem...
Rapid advances in sequencing technology have resulted in the availability of genomes from organisms ...
Abstract—The rapid burgeoning of available protein data makes the use of clustering within families ...
Rapid advances in sequencing technology have resulted in the availability of genomes from organisms ...
Background: Searching a biological sequence database with a query sequence looking for homologues ha...
With the rapid development of genome sequencing technologies, complete genomes are becoming more ava...
Phylogenetic analysis of proteins using multiple sequence alignment (MSA) assumes an underlying evol...
Abstract Background New computational resources are needed to manage the increasing volume of biolog...
Background: Phylogenetic analysis can be used to divide a protein family into subfamilies in the abs...
Background: Phylogenetic analysis can be used to divide a protein family into subfamilies in the abs...
Background: The number of protein family members defined by DNA sequencing is usually much larger t...
International audienceBACKGROUND: The number of protein family members defined by DNA sequencing is ...
International audienceBACKGROUND: The number of protein family members defined by DNA sequencing is ...
International audienceBACKGROUND: The number of protein family members defined by DNA sequencing is ...
International audienceBACKGROUND: The number of protein family members defined by DNA sequencing is ...
---> Background: The identification of subfamilies within a protein family is a challenging problem...
Rapid advances in sequencing technology have resulted in the availability of genomes from organisms ...
Abstract—The rapid burgeoning of available protein data makes the use of clustering within families ...
Rapid advances in sequencing technology have resulted in the availability of genomes from organisms ...
Background: Searching a biological sequence database with a query sequence looking for homologues ha...
With the rapid development of genome sequencing technologies, complete genomes are becoming more ava...
Phylogenetic analysis of proteins using multiple sequence alignment (MSA) assumes an underlying evol...
Abstract Background New computational resources are needed to manage the increasing volume of biolog...