Biological sequence classification (such as protein remote homology detection) solely based on sequence data is an important problem in computational biology, especially in the current genomics era, when large amount of sequence data are becoming available. Support vector machines (SVMs) based on mismatch string kernels were previously applied to solve this problem, achieving reasonable success. However, they still perform poorly on difficult protein families. In this paper, we propose two approaches to solve the protein remote homology detection problem: one uses a convex combination of random-walk kernels to approximate the random-walk kernel with the optimal random steps, and the other constructs an empirical-map kernel using a profile k...
The classification of protein sequences using string kernels provides valuable insights for protein ...
The classification of protein sequences using string kernels provides valuable insights for protein ...
A new method for detecting remote protein homologies is introduced and shown to perform well in clas...
Determining protein sequence similarity is an important task for protein classification and homology...
Motivation Classification of proteins sequences into functional and structural families based on seq...
Motivation: Classification of proteins sequences into functional and structural families based on se...
Motivation: Remote homology detection between protein se-quences is a central problem in computation...
International audienceMOTIVATION: Remote homology detection between protein sequences is a central p...
Remote homology detection between protein sequences is a central problem in computational biology. D...
We introduce a class of string kernels, called mismatch kernels, for use with support vector machine...
We introduce a class of string kernels, called mismatch kernels, for use with support vector machin...
We introduce a class of string kernels, called mismatch kernels, for use with support vector machine...
Abstract Background Classification of protein sequences is a central problem in computational biolog...
A central problem in computational biology is the classification of related proteins into functional...
Two new techniques for remote protein homology detection particulary suited for sparse data are intr...
The classification of protein sequences using string kernels provides valuable insights for protein ...
The classification of protein sequences using string kernels provides valuable insights for protein ...
A new method for detecting remote protein homologies is introduced and shown to perform well in clas...
Determining protein sequence similarity is an important task for protein classification and homology...
Motivation Classification of proteins sequences into functional and structural families based on seq...
Motivation: Classification of proteins sequences into functional and structural families based on se...
Motivation: Remote homology detection between protein se-quences is a central problem in computation...
International audienceMOTIVATION: Remote homology detection between protein sequences is a central p...
Remote homology detection between protein sequences is a central problem in computational biology. D...
We introduce a class of string kernels, called mismatch kernels, for use with support vector machine...
We introduce a class of string kernels, called mismatch kernels, for use with support vector machin...
We introduce a class of string kernels, called mismatch kernels, for use with support vector machine...
Abstract Background Classification of protein sequences is a central problem in computational biolog...
A central problem in computational biology is the classification of related proteins into functional...
Two new techniques for remote protein homology detection particulary suited for sparse data are intr...
The classification of protein sequences using string kernels provides valuable insights for protein ...
The classification of protein sequences using string kernels provides valuable insights for protein ...
A new method for detecting remote protein homologies is introduced and shown to perform well in clas...