An important problem in genomics is automatic-ally clustering homologous proteins when only sequence information is available. Most methods for clustering proteins are local, and are based on simply thresholding a measure related to sequence distance. We first show how locality limits the per-formance of such methods by analysing the distribu-tion of distances between protein sequences. We then present a global method based on spectral clustering and provide theoretical justification of why it will have a remarkable improvement over local methods. We extensively tested our method and compared its performance with other local methods on several subsets of the SCOP (Structural Classification of Proteins) database, a gold standard for protein ...
This paper describes a new technique for parallelizing protein clustering, an important bioinformati...
Remote homology detection among proteins utilizing only the unlabelled sequences is a central proble...
Background: Searching a biological sequence database with a query sequence looking for homologues ha...
An important problem in genomics is automatically clustering homologous proteins when only sequence ...
An important problem in genomics is automatically clustering homologous proteins when only sequence ...
An important problem in genomics is automatically clustering homologous proteins when only sequence ...
An important problem in genomics is automatically clustering homologous proteins when only sequence ...
International audienceBackground: An important problem in computational biology is the automatic det...
Evaluation and improvements of clustering algorithms for detecting remote homologous protein familie...
Background An important problem in genomics is the automatic inference of groups of homologous prote...
The sizes of the protein databases are growing rapidly nowadays, thus it becomes increasingly import...
One of the main reasons for protein clustering is prediction of structure, function and evolution. M...
Next-generation sequencing has allowed many new protein sequences to be identified. However, this ex...
<p><b>Copyright information:</b></p><p>Taken from "Spectral clustering of protein sequences"</p><p>N...
This paper describes a new technique for parallelizing protein clustering, an important bioinformati...
This paper describes a new technique for parallelizing protein clustering, an important bioinformati...
Remote homology detection among proteins utilizing only the unlabelled sequences is a central proble...
Background: Searching a biological sequence database with a query sequence looking for homologues ha...
An important problem in genomics is automatically clustering homologous proteins when only sequence ...
An important problem in genomics is automatically clustering homologous proteins when only sequence ...
An important problem in genomics is automatically clustering homologous proteins when only sequence ...
An important problem in genomics is automatically clustering homologous proteins when only sequence ...
International audienceBackground: An important problem in computational biology is the automatic det...
Evaluation and improvements of clustering algorithms for detecting remote homologous protein familie...
Background An important problem in genomics is the automatic inference of groups of homologous prote...
The sizes of the protein databases are growing rapidly nowadays, thus it becomes increasingly import...
One of the main reasons for protein clustering is prediction of structure, function and evolution. M...
Next-generation sequencing has allowed many new protein sequences to be identified. However, this ex...
<p><b>Copyright information:</b></p><p>Taken from "Spectral clustering of protein sequences"</p><p>N...
This paper describes a new technique for parallelizing protein clustering, an important bioinformati...
This paper describes a new technique for parallelizing protein clustering, an important bioinformati...
Remote homology detection among proteins utilizing only the unlabelled sequences is a central proble...
Background: Searching a biological sequence database with a query sequence looking for homologues ha...