In biological sequence classification, it is common to convert variable-length sequences into fixed-length vectors via pairwise sequence comparison. This pairwise approach, however, can lead to feature vectors with dimension equal to the training set size, causing the curse of dimensionality. This calls for feature selection methods that can weed out irrelevant features to reduce training and recognition time. In this paper, we propose to train an SVM using the full-feature column vectors of a pairwise scoring matrix and select the relevant features based on the support vectors of the SVM. The idea stems from the fact that pairwise scoring matrices are symmetric and support vectors are important for classification. We refer to this approach...
Abstract Background The functions of proteins are closely related to their subcellular locations. In...
Abstract Background The large gap between the number of protein sequences in databases and the numbe...
We propose novel algorithms for solving the so-called Support Vector Multiple Kernel Learning proble...
Learning strategies are traditionally divided into two categories: unsupervised learning and supervi...
effective data mining system lies in the representation of pattern vectors. For many bioinformatic a...
Protein subcellular localization is a crucial ingredient to many important inferences about cellular...
Protein subcellular localization is a crucial ingredient to many important inferences about cellular...
We propose an elegant multiclass prediction approach for protein subcellular localization. First we ...
Abstract: Protein subcellular localization is a crucial ingredient to many important inferences abou...
Multi-label classification has received increasing attention in computational proteomics, especially...
Protein subcellular localization is a crucial ingredient to many important inferences about cellular...
Kernel-based machine learning algorithms are versatile tools for biological sequence data analysis. ...
Abstract. This paper provides a solution to the curse of dimensionality problem in the pairwise scor...
Predicting the destination of a protein in a cell is important for annotating the function of the pr...
A wide variety of methods have been proposed in protein subnuclear localization to improve the predi...
Abstract Background The functions of proteins are closely related to their subcellular locations. In...
Abstract Background The large gap between the number of protein sequences in databases and the numbe...
We propose novel algorithms for solving the so-called Support Vector Multiple Kernel Learning proble...
Learning strategies are traditionally divided into two categories: unsupervised learning and supervi...
effective data mining system lies in the representation of pattern vectors. For many bioinformatic a...
Protein subcellular localization is a crucial ingredient to many important inferences about cellular...
Protein subcellular localization is a crucial ingredient to many important inferences about cellular...
We propose an elegant multiclass prediction approach for protein subcellular localization. First we ...
Abstract: Protein subcellular localization is a crucial ingredient to many important inferences abou...
Multi-label classification has received increasing attention in computational proteomics, especially...
Protein subcellular localization is a crucial ingredient to many important inferences about cellular...
Kernel-based machine learning algorithms are versatile tools for biological sequence data analysis. ...
Abstract. This paper provides a solution to the curse of dimensionality problem in the pairwise scor...
Predicting the destination of a protein in a cell is important for annotating the function of the pr...
A wide variety of methods have been proposed in protein subnuclear localization to improve the predi...
Abstract Background The functions of proteins are closely related to their subcellular locations. In...
Abstract Background The large gap between the number of protein sequences in databases and the numbe...
We propose novel algorithms for solving the so-called Support Vector Multiple Kernel Learning proble...