This master's thesis consider clustering of protein sequences based on primary structure of proteins. Studies the protein sequences from they primary structure. Describes methods for similarities in the amino acid sequences of proteins, cluster analysis and clustering algorithms. This thesis presents concept of distance function based on similarity of protein sequences and implements clustering algorithms ANGES, k-means, k-medoids in Python programming language
Background: Genome-sequencing projects are currently producing an enormous amount of new sequences a...
The sizes of the protein databases are growing rapidly nowadays, thus it becomes increasingly import...
Protein structural annotation and classification is an important and challenging problem in bioinfor...
One of the main reasons for protein clustering is prediction of structure, function and evolution. M...
Protein sequence motifs are short conserved subsequences common to related protein sequences. Inform...
Protein sequences clustering based on their sequence patterns has attracted lots of research efforts...
In this paper, a technique to reduce time and space during protein sequence clustering and classific...
Protein tertiary structure plays a very important role in determining its possible functional sites ...
We analyze all known protein sequences in search for a global map of protein space that is consisten...
An important problem in genomics is automatically clustering homologous proteins when only sequence ...
Abstract Background The sequencing of the human genome has enabled us to access a comprehensive list...
International audienceWe present a thorough analysis of the relation between amino acid sequence and...
The paper focuses on the development of a software tool for protein clustering according to their am...
An important problem in genomics is automatic-ally clustering homologous proteins when only sequence...
Biological research has generated vast quantities of protein sequences. One of the current outstandi...
Background: Genome-sequencing projects are currently producing an enormous amount of new sequences a...
The sizes of the protein databases are growing rapidly nowadays, thus it becomes increasingly import...
Protein structural annotation and classification is an important and challenging problem in bioinfor...
One of the main reasons for protein clustering is prediction of structure, function and evolution. M...
Protein sequence motifs are short conserved subsequences common to related protein sequences. Inform...
Protein sequences clustering based on their sequence patterns has attracted lots of research efforts...
In this paper, a technique to reduce time and space during protein sequence clustering and classific...
Protein tertiary structure plays a very important role in determining its possible functional sites ...
We analyze all known protein sequences in search for a global map of protein space that is consisten...
An important problem in genomics is automatically clustering homologous proteins when only sequence ...
Abstract Background The sequencing of the human genome has enabled us to access a comprehensive list...
International audienceWe present a thorough analysis of the relation between amino acid sequence and...
The paper focuses on the development of a software tool for protein clustering according to their am...
An important problem in genomics is automatic-ally clustering homologous proteins when only sequence...
Biological research has generated vast quantities of protein sequences. One of the current outstandi...
Background: Genome-sequencing projects are currently producing an enormous amount of new sequences a...
The sizes of the protein databases are growing rapidly nowadays, thus it becomes increasingly import...
Protein structural annotation and classification is an important and challenging problem in bioinfor...