<p>Reduced amino acid alphabets are used to show similarity in sequence between two proteins that may not otherwise be detected. This can occur if slight changes or mutations have occurred over time through evolution and divergence of species. In this case the structure would remain similar, but the sequence would appear to be much different. It is important to explore different methods in which to determine the reduced amino acid alphabet that produces the most accurate results. Here we describe a program we created to convert individual protein sequences and entire databases to a user-defined reduced amino acid alphabet. An optimal reduced alphabet would need to conserve the fold of the original protein while also showing similarities in ...
One of the fundamental challenges in computational biology is the identification of evolutionarily r...
Proteins are macromolecules that play a pivotal role in biological processes in living organisms. St...
The ever increasing number of sequences in protein databases usually turns out large numbers of homo...
Motivation: Many proteins with vastly dissimilar sequences are found to share a common fold, as evid...
International audienceProtein sequence world is considerably larger than structure world. In consequ...
A central problem in creating simplified amino acid alphabets is narrowing down the massive number ...
Database scanning programs such as BLAST and FASTA are used nowadays by most biologists for the post...
Amino acid sequence analysis provides important insight into the structure of proteins, which in tur...
Abstract Background Quantitative descriptions of amino acid similarity, expressed as probabilistic m...
Motivation: Many proteins with vastly dissimilar sequences are found to share a common fold, as evid...
Every year an increasing number of amino acid sequences of proteins are being solved by genome seque...
The structural annotation of proteins with no detectable homologs of known 3D structure identified u...
The mapping of secondary structure elements onto amino acid sequences enhances the quality of alignm...
AbstractAmino acid residues may be divided into groups according to similarity of function, or evolu...
The use of k-word matches was developed as a fast alignment-free comparison method for dna sequences...
One of the fundamental challenges in computational biology is the identification of evolutionarily r...
Proteins are macromolecules that play a pivotal role in biological processes in living organisms. St...
The ever increasing number of sequences in protein databases usually turns out large numbers of homo...
Motivation: Many proteins with vastly dissimilar sequences are found to share a common fold, as evid...
International audienceProtein sequence world is considerably larger than structure world. In consequ...
A central problem in creating simplified amino acid alphabets is narrowing down the massive number ...
Database scanning programs such as BLAST and FASTA are used nowadays by most biologists for the post...
Amino acid sequence analysis provides important insight into the structure of proteins, which in tur...
Abstract Background Quantitative descriptions of amino acid similarity, expressed as probabilistic m...
Motivation: Many proteins with vastly dissimilar sequences are found to share a common fold, as evid...
Every year an increasing number of amino acid sequences of proteins are being solved by genome seque...
The structural annotation of proteins with no detectable homologs of known 3D structure identified u...
The mapping of secondary structure elements onto amino acid sequences enhances the quality of alignm...
AbstractAmino acid residues may be divided into groups according to similarity of function, or evolu...
The use of k-word matches was developed as a fast alignment-free comparison method for dna sequences...
One of the fundamental challenges in computational biology is the identification of evolutionarily r...
Proteins are macromolecules that play a pivotal role in biological processes in living organisms. St...
The ever increasing number of sequences in protein databases usually turns out large numbers of homo...