The classification of protein sequences provides valuable insights into bioinformatics. Most existing methods are based on sequence alignment algorithms, which become time-consuming as the size of the database increases. Therefore, there is a need to develop an improved method for effectively classifying protein sequences. In this paper, we propose a novel accumulated natural vector method to cluster protein sequences at a lower time cost without reducing accuracy. Our method projects each protein sequence as a point in a 250-dimensional space according to its amino acid distribution. Thus, the biological distance between any two proteins can be easily measured by the Euclidean distance between the corresponding points in the 250-dimensiona...
In this paper, a technique to reduce time and space during protein sequence clustering and classific...
Biological sequence analysis is at the core of bioinformatics, bringing together several fields, fro...
Predicting protein structure and function from amino acid sequences is a central aim of bioinformati...
Protein sequences vary in their length and are not readily amenable to conventional data mining tech...
One of the main reasons for protein clustering is prediction of structure, function and evolution. M...
Protein sequences clustering based on their sequence patterns has attracted lots of research efforts...
Abstract Background The sequencing of the human genome has enabled us to access a comprehensive list...
This master's thesis consider clustering of protein sequences based on primary structure of proteins...
To classify proteins into functional families based on their primary sequences, existing classificat...
Abstract—The rapid burgeoning of available protein data makes the use of clustering within families ...
In this paper, we have proposed a novel alignment-free method for comparing the similarity of protei...
<div><p>In this paper, we have proposed a novel alignment-free method for comparing the similarity o...
A global classification of all currently known protein sequences is performed. Every protein sequenc...
ooke.ca We are interested in the problem of grouping families of non-alignable protein sequences, su...
Protein sequence motifs are short conserved subsequences common to related protein sequences. Inform...
In this paper, a technique to reduce time and space during protein sequence clustering and classific...
Biological sequence analysis is at the core of bioinformatics, bringing together several fields, fro...
Predicting protein structure and function from amino acid sequences is a central aim of bioinformati...
Protein sequences vary in their length and are not readily amenable to conventional data mining tech...
One of the main reasons for protein clustering is prediction of structure, function and evolution. M...
Protein sequences clustering based on their sequence patterns has attracted lots of research efforts...
Abstract Background The sequencing of the human genome has enabled us to access a comprehensive list...
This master's thesis consider clustering of protein sequences based on primary structure of proteins...
To classify proteins into functional families based on their primary sequences, existing classificat...
Abstract—The rapid burgeoning of available protein data makes the use of clustering within families ...
In this paper, we have proposed a novel alignment-free method for comparing the similarity of protei...
<div><p>In this paper, we have proposed a novel alignment-free method for comparing the similarity o...
A global classification of all currently known protein sequences is performed. Every protein sequenc...
ooke.ca We are interested in the problem of grouping families of non-alignable protein sequences, su...
Protein sequence motifs are short conserved subsequences common to related protein sequences. Inform...
In this paper, a technique to reduce time and space during protein sequence clustering and classific...
Biological sequence analysis is at the core of bioinformatics, bringing together several fields, fro...
Predicting protein structure and function from amino acid sequences is a central aim of bioinformati...