Abstract Background Classification of protein sequences is a central problem in computational biology. Currently, among computational methods discriminative kernel-based approaches provide the most accurate results. However, kernel-based methods often lack an interpretable model for analysis of discriminative sequence features, and predictions on new sequences usually are computationally expensive. Results In this work we present a novel kernel for protein sequences based on average word similarity between two sequences. We show that this kernel gives rise to a feature space that allows analysis of discriminative features and fast classification of new sequences. We demonstrate the performance of our approach on a widely-used benchmark setu...
The understanding of protein functions and there-by characterization is essential to modeling comple...
Effective representation of the protein sequence is a key issue in detecting remote protein homology...
Motivation Classification of proteins sequences into functional and structural families based on seq...
Determining protein sequence similarity is an important task for protein classification and homology...
The automatic classification of protein sequences into families is of great help for the functional ...
International audienceMOTIVATION: Remote homology detection between protein sequences is a central p...
Biological sequence classification (such as protein remote homology detection) solely based on seque...
Remote homology detection between protein sequences is a central problem in computational biology. D...
The classification of protein sequences using string kernels provides valuable insights for protein ...
The classification of protein sequences using string kernels provides valuable insights for protein ...
Abstract Background The challenge of remote homology detection is that many evolutionarily related s...
The classification of protein sequences using string kernels provides valuable insights for protein ...
The classification of protein sequences using string kernels provides valuable insights for protein ...
The amount of the information being churned out by the field of biology has jumped manifold and now ...
A central problem in computational biology is the classification of related proteins into functional...
The understanding of protein functions and there-by characterization is essential to modeling comple...
Effective representation of the protein sequence is a key issue in detecting remote protein homology...
Motivation Classification of proteins sequences into functional and structural families based on seq...
Determining protein sequence similarity is an important task for protein classification and homology...
The automatic classification of protein sequences into families is of great help for the functional ...
International audienceMOTIVATION: Remote homology detection between protein sequences is a central p...
Biological sequence classification (such as protein remote homology detection) solely based on seque...
Remote homology detection between protein sequences is a central problem in computational biology. D...
The classification of protein sequences using string kernels provides valuable insights for protein ...
The classification of protein sequences using string kernels provides valuable insights for protein ...
Abstract Background The challenge of remote homology detection is that many evolutionarily related s...
The classification of protein sequences using string kernels provides valuable insights for protein ...
The classification of protein sequences using string kernels provides valuable insights for protein ...
The amount of the information being churned out by the field of biology has jumped manifold and now ...
A central problem in computational biology is the classification of related proteins into functional...
The understanding of protein functions and there-by characterization is essential to modeling comple...
Effective representation of the protein sequence is a key issue in detecting remote protein homology...
Motivation Classification of proteins sequences into functional and structural families based on seq...