An amino acid sequence of a protein may be decomposed into consecutive overlapping strings of length K. How unique is the converse, i.e., reconstruction of amino acid sequences using the set of K-strings obtained in the decomposition? This problem may be transformed into the problem of counting the number of Eulerian loops in an Euler graph, though the well-known formula must be modified. By exhaustive enumeration and by using the modified formula we show that the reconstruction is unique at K equal or greater than 5 for an overwhelming majority of the proteins in the PDB.seq database. The corresponding Euler graphs provide a means to study the structure of repeated segments in protein sequences
Edited in cooperation with Robert MercaşStrings (aka sequences or words) form the most basic and nat...
A novel scheme is introduced to capture the spatial correlations of consecutive amino acids in n...
AbstractThis paper presents a method for classifying a large and mixed set of uncharacterized sequen...
All known terrestrial proteins are coded as continuous strings of ≈20 amino acids. The patterns form...
All known terrestrial proteins are coded as continuous strings of ≈20 amino acids. The patterns form...
Sequencing by hybridization is a method of reconstructing a long DNA string - that is, figuring out ...
AbstractSequencing by hybridization is a method of reconstructing a long DNA string — that is, figur...
BACKGROUND: Protein loops encompass 50% of protein residues in available three-dimensional structure...
We seek to understand the interplay between amino acid sequence and local structure in proteins. Are...
Duplications play a major role in protein evolution and result in intragenic repeats found in about ...
Some natural proteins display recurrent structural patterns. Despite being highly similar at the ter...
The notion of energy landscapes provides conceptual tools for understanding the complexities of prot...
Proteins containing amino acid repeats are considered to be of great importance in evolutionary stud...
In recent years, identification of sequence patterns has been given immense importance to understand...
The amino acid sequences of proteins determine their three-dimensional structures and functions. How...
Edited in cooperation with Robert MercaşStrings (aka sequences or words) form the most basic and nat...
A novel scheme is introduced to capture the spatial correlations of consecutive amino acids in n...
AbstractThis paper presents a method for classifying a large and mixed set of uncharacterized sequen...
All known terrestrial proteins are coded as continuous strings of ≈20 amino acids. The patterns form...
All known terrestrial proteins are coded as continuous strings of ≈20 amino acids. The patterns form...
Sequencing by hybridization is a method of reconstructing a long DNA string - that is, figuring out ...
AbstractSequencing by hybridization is a method of reconstructing a long DNA string — that is, figur...
BACKGROUND: Protein loops encompass 50% of protein residues in available three-dimensional structure...
We seek to understand the interplay between amino acid sequence and local structure in proteins. Are...
Duplications play a major role in protein evolution and result in intragenic repeats found in about ...
Some natural proteins display recurrent structural patterns. Despite being highly similar at the ter...
The notion of energy landscapes provides conceptual tools for understanding the complexities of prot...
Proteins containing amino acid repeats are considered to be of great importance in evolutionary stud...
In recent years, identification of sequence patterns has been given immense importance to understand...
The amino acid sequences of proteins determine their three-dimensional structures and functions. How...
Edited in cooperation with Robert MercaşStrings (aka sequences or words) form the most basic and nat...
A novel scheme is introduced to capture the spatial correlations of consecutive amino acids in n...
AbstractThis paper presents a method for classifying a large and mixed set of uncharacterized sequen...