Analysis of large-scale sequential data has become an important task in machine learning and pattern recognition, inspired in part by numerous scientific and technological applications such as the document and text classification or the analysis of biological sequences. However, current computational methods for sequence comparison still lack accuracy and scalability necessary for reliable analysis of large datasets. To this end, we develop a new framework (efficient algorithms and methods) that solve sequence matching, comparison, classification, and pattern extraction problems in linear time, with increased accuracy, improving over the prior art. In particular, we propose novel ways of modeling sequences under complex transformations (s...
We describe several families of k-mer based string kernels related to the recently presented mismatc...
We model the evolution of biological and linguistic sequences by comparing their statistical propert...
Motivation: Classification of proteins sequences into functional and structural families based on se...
Problems of analysis and modeling of sequential data arise in many practical applications. In this w...
We present a new family of linear time algorithms for string comparison with mismatches under the st...
We present a new family of linear time algorithms for string comparison with mismatches under the st...
We present a new family of linear time algorithms for string comparison with mismatches under the st...
String kernel-based machine learning methods have yielded great success in practical tasks of struct...
Efficient and expressive comparison of sequences is an essential procedure for learning with se-quen...
Analysis of large-scale sequential data has become an important task in machine learning and pattern...
Determining protein sequence similarity is an important task for protein classification and homology...
We model the evolution of biological and linguistic sequences by comparing their statistical propert...
We model the evolution of biological and linguistic sequences by comparing their statistical propert...
Kernel-based approaches for sequence classification have been successfully applied to a variety of d...
Motivation Classification of proteins sequences into functional and structural families based on seq...
We describe several families of k-mer based string kernels related to the recently presented mismatc...
We model the evolution of biological and linguistic sequences by comparing their statistical propert...
Motivation: Classification of proteins sequences into functional and structural families based on se...
Problems of analysis and modeling of sequential data arise in many practical applications. In this w...
We present a new family of linear time algorithms for string comparison with mismatches under the st...
We present a new family of linear time algorithms for string comparison with mismatches under the st...
We present a new family of linear time algorithms for string comparison with mismatches under the st...
String kernel-based machine learning methods have yielded great success in practical tasks of struct...
Efficient and expressive comparison of sequences is an essential procedure for learning with se-quen...
Analysis of large-scale sequential data has become an important task in machine learning and pattern...
Determining protein sequence similarity is an important task for protein classification and homology...
We model the evolution of biological and linguistic sequences by comparing their statistical propert...
We model the evolution of biological and linguistic sequences by comparing their statistical propert...
Kernel-based approaches for sequence classification have been successfully applied to a variety of d...
Motivation Classification of proteins sequences into functional and structural families based on seq...
We describe several families of k-mer based string kernels related to the recently presented mismatc...
We model the evolution of biological and linguistic sequences by comparing their statistical propert...
Motivation: Classification of proteins sequences into functional and structural families based on se...