Background: In protein sequence classification, identification of the sequence motifs or n-grams that can precisely discriminate between classes is a more interesting scientific question than the classification itself. A number of classification methods aim at accurate classification but fail to explain which sequence features indeed contribute to the accuracy. We hypothesize that sequences in lower denominations (n-grams) can be used to explore the sequence landscape and to identify class-specific motifs that discriminate between classes during classification. Discriminative n-grams are short peptide sequences that are highly frequent in one class but are either minimally present or absent in other classes. In this study, we present a new ...
We use methods from data mining and knowledge discovery to design an algorithm for detecting motifs ...
Modern sequencing initiatives have uncovered a large number of protein sequence data. The exponentia...
www.cs.iastate.edu/~honavar/aigroup.html This paper describes an approach to data-driven discovery o...
Background: In protein sequence classification, identification of the sequence motifs or n-grams tha...
Discrete motifs that discriminate functional classes of proteins are useful for classifying new sequ...
We use methods from Data Mining and Knowledge Discovery to design an algorithm for detecting motifs ...
Abstract Background This paper deals with the preprocessing of protein sequences for supervised clas...
We describe a method for discovering active motifs in a set of related protein sequences. The method...
Most existing methods for sequence-based classification use exhaustive feature generation, employing...
Summary. Protein function prediction, i.e. classification of protein sequences according to their bi...
Proteins sharing a certain biological role often contain short sequences, or motifs, that are conser...
Predicting the function of a protein from its sequence is typically addressed using sequence-similar...
Linear motifs are short protein subsequences that mediate protein interactions. Hundreds of motif cl...
Several computer algorithms now exist for discovering multiple motifs (expressed as weight matrices)...
MOTIVATION: Identification of conserved motifs in biological sequences is crucial to unveil common s...
We use methods from data mining and knowledge discovery to design an algorithm for detecting motifs ...
Modern sequencing initiatives have uncovered a large number of protein sequence data. The exponentia...
www.cs.iastate.edu/~honavar/aigroup.html This paper describes an approach to data-driven discovery o...
Background: In protein sequence classification, identification of the sequence motifs or n-grams tha...
Discrete motifs that discriminate functional classes of proteins are useful for classifying new sequ...
We use methods from Data Mining and Knowledge Discovery to design an algorithm for detecting motifs ...
Abstract Background This paper deals with the preprocessing of protein sequences for supervised clas...
We describe a method for discovering active motifs in a set of related protein sequences. The method...
Most existing methods for sequence-based classification use exhaustive feature generation, employing...
Summary. Protein function prediction, i.e. classification of protein sequences according to their bi...
Proteins sharing a certain biological role often contain short sequences, or motifs, that are conser...
Predicting the function of a protein from its sequence is typically addressed using sequence-similar...
Linear motifs are short protein subsequences that mediate protein interactions. Hundreds of motif cl...
Several computer algorithms now exist for discovering multiple motifs (expressed as weight matrices)...
MOTIVATION: Identification of conserved motifs in biological sequences is crucial to unveil common s...
We use methods from data mining and knowledge discovery to design an algorithm for detecting motifs ...
Modern sequencing initiatives have uncovered a large number of protein sequence data. The exponentia...
www.cs.iastate.edu/~honavar/aigroup.html This paper describes an approach to data-driven discovery o...