One of the most accurate multi-class protein classification systems continues to be the profile-based SVM kernel introduced by the Leslie group. Unfortunately, its CPU requirements render it too slow for practical applications of large-scale classification tasks. Here, we introduce several software improvements that enable significant acceleration. Using various non-redundant data sets, we demonstrate that our new implementation reaches a maximal speed-up as high as 14-fold for calculating the same kernel matrix. Some predictions are over 200 times faster and render the kernel as possibly the top contender in a low ratio of speed/performance. Additionally, we explain how to parallelize various computations and provide an integrative program...
fast_protein_cluster is a fast, parallel and memory efficient package used to cluster 60 000 sets of...
Determining protein sequence similarity is an important task for protein classification and homology...
For classification tasks, such as protein-protein interactions (PPI), support vector machines (SVMs)...
One of the most accurate multi-class protein classification systems continues to be the profile-base...
One of the most accurate multi-class protein classification systems continues to be the profile-base...
In recent years, more and more high-throughput data sources useful for protein complex prediction ha...
Support vector machines (SVM) have been successfully used to classify proteins into functional categ...
Two new techniques for remote protein homology detection particulary suited for sparse data are intr...
Kernel-based machine learning algorithms are versatile tools for biological sequence data analysis. ...
Determining protein sequence similarity is an important task for protein classification and homology...
We describe several families of k-mer based string kernels related to the recently presented mismatc...
Background: Predicting a protein's structural class from its amino acid sequence is a fundamental pr...
Motivation: fast_protein_cluster is a fast, parallel and memory efficient package used to cluster 60...
A learning task with thousands of training examples in Support Vector Machine (SVM) demands large am...
With the increasing power of computers, the amount of data that can be processed in small periods of...
fast_protein_cluster is a fast, parallel and memory efficient package used to cluster 60 000 sets of...
Determining protein sequence similarity is an important task for protein classification and homology...
For classification tasks, such as protein-protein interactions (PPI), support vector machines (SVMs)...
One of the most accurate multi-class protein classification systems continues to be the profile-base...
One of the most accurate multi-class protein classification systems continues to be the profile-base...
In recent years, more and more high-throughput data sources useful for protein complex prediction ha...
Support vector machines (SVM) have been successfully used to classify proteins into functional categ...
Two new techniques for remote protein homology detection particulary suited for sparse data are intr...
Kernel-based machine learning algorithms are versatile tools for biological sequence data analysis. ...
Determining protein sequence similarity is an important task for protein classification and homology...
We describe several families of k-mer based string kernels related to the recently presented mismatc...
Background: Predicting a protein's structural class from its amino acid sequence is a fundamental pr...
Motivation: fast_protein_cluster is a fast, parallel and memory efficient package used to cluster 60...
A learning task with thousands of training examples in Support Vector Machine (SVM) demands large am...
With the increasing power of computers, the amount of data that can be processed in small periods of...
fast_protein_cluster is a fast, parallel and memory efficient package used to cluster 60 000 sets of...
Determining protein sequence similarity is an important task for protein classification and homology...
For classification tasks, such as protein-protein interactions (PPI), support vector machines (SVMs)...