Proteins are macromolecules that play a pivotal role in biological processes in living organisms. Structural information for proteins is collected in a large Protein Data Bank database, which contains at this time over 122,000 structures [24]. Grouping, or clustering, similar protein sequences based on their similarity allows biologists to identify homologous sequences, or those with shared gene ancestry.The current implementation on the RCSB PDB site (www.rcsb.org) uses BLASTClust [16], which is run weekly to account for the frequent protein data placed into the Protein Data Bank database. The issue is that these updates take about half a day to run. To determine the similarity between pairs of protein sequences, there are methods that ali...
Background: The function of a protein can be deciphered with higher accuracy from its structure than...
Motivation: The Database of known protein structures (PDB) is increasing rapidly. This results in a ...
Computing sequence similarity is a fundamental task in biology, with alignment forming the basis for...
One of the main reasons for protein clustering is prediction of structure, function and evolution. M...
Abstract Background Protein sequence alignment analyses have become a crucial step for many bioinfor...
Abstract—The rapid burgeoning of available protein data makes the use of clustering within families ...
This paper describes a new technique for parallelizing protein clustering, an important bioinformati...
Next-generation sequencing has allowed many new protein sequences to be identified. However, this ex...
ooke.ca We are interested in the problem of grouping families of non-alignable protein sequences, su...
This paper describes a new technique for parallelizing protein clustering, an important bioinformati...
Background Fueled by rapid progress in high-throughput sequencing, the size of public sequence datab...
Background: Genome-sequencing projects are currently producing an enormous amount of new sequences a...
BACKGROUND:The sequencing of the human genome has enabled us to access a comprehensive list of genes...
Background Clustering of protein sequences is of key importance in predicting the structure and fun...
Abstract Background Clustering of protein sequences is of key importance in predicting the structure...
Background: The function of a protein can be deciphered with higher accuracy from its structure than...
Motivation: The Database of known protein structures (PDB) is increasing rapidly. This results in a ...
Computing sequence similarity is a fundamental task in biology, with alignment forming the basis for...
One of the main reasons for protein clustering is prediction of structure, function and evolution. M...
Abstract Background Protein sequence alignment analyses have become a crucial step for many bioinfor...
Abstract—The rapid burgeoning of available protein data makes the use of clustering within families ...
This paper describes a new technique for parallelizing protein clustering, an important bioinformati...
Next-generation sequencing has allowed many new protein sequences to be identified. However, this ex...
ooke.ca We are interested in the problem of grouping families of non-alignable protein sequences, su...
This paper describes a new technique for parallelizing protein clustering, an important bioinformati...
Background Fueled by rapid progress in high-throughput sequencing, the size of public sequence datab...
Background: Genome-sequencing projects are currently producing an enormous amount of new sequences a...
BACKGROUND:The sequencing of the human genome has enabled us to access a comprehensive list of genes...
Background Clustering of protein sequences is of key importance in predicting the structure and fun...
Abstract Background Clustering of protein sequences is of key importance in predicting the structure...
Background: The function of a protein can be deciphered with higher accuracy from its structure than...
Motivation: The Database of known protein structures (PDB) is increasing rapidly. This results in a ...
Computing sequence similarity is a fundamental task in biology, with alignment forming the basis for...