In this paper, a technique to reduce time and space during protein sequence clustering and classification is presented. During training and testing phase, the similarity score value between a pair of sequences is determined by selecting a portion of the sequence instead of the entire sequence. It is like selecting a subset of features for sequence data sets. The experimental results of the proposed method show that the classification accuracy (CA) using the prototypes generated/used does not degrade much but the training and testing time are reduced significantly. Thus the experimental results indicate that the similarity score need not be calculated by considering the entire length of the sequence for achieving a good CA. Even space requir...
Biological research has generated vast quantities of protein sequences. One of the current outstandi...
An important problem in genomics is automatically clustering homologous proteins when only sequence ...
In this paper we investigate the usage of a clustering algorithm as a feature extraction technique t...
In this paper, a technique to reduce time and space during protein sequence clustering and classific...
In this paper, an efficient K-medians clustering (unsupervised) algorithm for prototype selection an...
One of the main reasons for protein clustering is prediction of structure, function and evolution. M...
Clustering is the division of data into groups of similar objects. The main objective of this unsupe...
International audienceBackground: An important problem in computational biology is the automatic det...
This master's thesis consider clustering of protein sequences based on primary structure of proteins...
The sizes of the protein databases are growing rapidly nowadays, thus it becomes increasingly import...
This paper describes a new technique for parallelizing protein clustering, an important bioinformati...
Background: Genome-sequencing projects are currently producing an enormous amount of new sequences a...
This paper describes a new technique for parallelizing protein clustering, an important bioinformati...
Abstract. We study the problem of efficiently clustering protein se-quences in a limited information...
Protein sequences clustering based on their sequence patterns has attracted lots of research efforts...
Biological research has generated vast quantities of protein sequences. One of the current outstandi...
An important problem in genomics is automatically clustering homologous proteins when only sequence ...
In this paper we investigate the usage of a clustering algorithm as a feature extraction technique t...
In this paper, a technique to reduce time and space during protein sequence clustering and classific...
In this paper, an efficient K-medians clustering (unsupervised) algorithm for prototype selection an...
One of the main reasons for protein clustering is prediction of structure, function and evolution. M...
Clustering is the division of data into groups of similar objects. The main objective of this unsupe...
International audienceBackground: An important problem in computational biology is the automatic det...
This master's thesis consider clustering of protein sequences based on primary structure of proteins...
The sizes of the protein databases are growing rapidly nowadays, thus it becomes increasingly import...
This paper describes a new technique for parallelizing protein clustering, an important bioinformati...
Background: Genome-sequencing projects are currently producing an enormous amount of new sequences a...
This paper describes a new technique for parallelizing protein clustering, an important bioinformati...
Abstract. We study the problem of efficiently clustering protein se-quences in a limited information...
Protein sequences clustering based on their sequence patterns has attracted lots of research efforts...
Biological research has generated vast quantities of protein sequences. One of the current outstandi...
An important problem in genomics is automatically clustering homologous proteins when only sequence ...
In this paper we investigate the usage of a clustering algorithm as a feature extraction technique t...