We propose a learning algorithm that discovers a motif represented by patterns and an alphabet indexing from biosequences. From only positive examples with the help of an alphabet indexing, the algorithm nds k regular patterns as a k-minimal multiple generalization (k-mmg for short). The computational results for transmembrane domains indicate that the combination of k-mmg and alphabet indexing works quite successful. We also introduce a partial alphabet indexing that transforms symbols dependently on the position in sequences.
AbstractWe introduce a new notion of motifs, called masks, that succinctly represents the repeated p...
This paper is a survey of approaches and algorithms used for the automatic discovery of patterns in ...
The problem of multiple global comparison in families of biological sequences has been well-studied....
A regular pattern is a string consisting of constant symbols and mutually distinct variables, and re...
Recently, several attempts have been made at applying machine learning method to protein motif disco...
The k-minimal multiple generalization (k-mmg) is a natural extension of the least generalization (lg...
In this chapter, we present a polynomial time algorithm, called a k-minimal multiple generalization...
A pattern is a string of constant symbols and variables. The language defined by a pattern p is the...
To appear in Machine Intelligence, 13In this paper, we describe a polynomial time algorithm, called ...
In this paper, we introduce a new notion of motifs, called \emph{masks}, that succinctly represen...
We introduce a new notion of motifs, called masks, that succinctly represents the repeated patterns ...
This paper surveys approaches to the discovery of patterns in biosequences and places these approach...
For two finite disjoint sets $P$ and $Q$ of strings over an alphabet $sum$, an alphabet indexing $p...
Given an input sequence of data, a motif is a repeating pattern, possibly interspersed with `dont ca...
this paper is as follows. This introduction is followed in Section 2 by a brief introduction to some...
AbstractWe introduce a new notion of motifs, called masks, that succinctly represents the repeated p...
This paper is a survey of approaches and algorithms used for the automatic discovery of patterns in ...
The problem of multiple global comparison in families of biological sequences has been well-studied....
A regular pattern is a string consisting of constant symbols and mutually distinct variables, and re...
Recently, several attempts have been made at applying machine learning method to protein motif disco...
The k-minimal multiple generalization (k-mmg) is a natural extension of the least generalization (lg...
In this chapter, we present a polynomial time algorithm, called a k-minimal multiple generalization...
A pattern is a string of constant symbols and variables. The language defined by a pattern p is the...
To appear in Machine Intelligence, 13In this paper, we describe a polynomial time algorithm, called ...
In this paper, we introduce a new notion of motifs, called \emph{masks}, that succinctly represen...
We introduce a new notion of motifs, called masks, that succinctly represents the repeated patterns ...
This paper surveys approaches to the discovery of patterns in biosequences and places these approach...
For two finite disjoint sets $P$ and $Q$ of strings over an alphabet $sum$, an alphabet indexing $p...
Given an input sequence of data, a motif is a repeating pattern, possibly interspersed with `dont ca...
this paper is as follows. This introduction is followed in Section 2 by a brief introduction to some...
AbstractWe introduce a new notion of motifs, called masks, that succinctly represents the repeated p...
This paper is a survey of approaches and algorithms used for the automatic discovery of patterns in ...
The problem of multiple global comparison in families of biological sequences has been well-studied....