We present in this paper three algorithms. The first extracts repeated motifs from a weighted sequence. The motifs correspond to words which occur at least q times and with hamming distance e in a weighted sequence with probability 1/k each time, where k is a small constant. The second algorithm extracts common motifs from a set of N 2 weighted sequences with hamming distance e. In the second case, the motifs must occur twice with probability 1/k, in 1 distinct sequences of the set. The third algorithm extracts maximal pairs from a weighted sequence. A pair in a sequence is the occurrence of the same substring twice. In addition, the algorithms presented in this paper improve slightly on previous work on these problem...
Abstract. Many algorithms for motif finding that are commonly used in bioinformatics start by sampli...
Abstract—Most of existing sequence mining algorithms focuses on mining for subsequences. A large cla...
Finding motifs in biological sequences is one of the most intriguing problems for string algorithm d...
In this paper, a natural probabilistic model for motif discovery has been used to experimentally tes...
Strings, sequences and, ultimately, motifs play a key role in bioinformatics. Here, a motif is a bio...
Motif inference is at the heart of several time-demanding computational tasks, such as in molecul...
OBJECTIVE: The human genome project has resulted in the generation of voluminous biological data. No...
Motivation: Motif identification for sequences has many important applications in biological studies...
Motif inference is at the heart of several time-demanding computational tasks, such as in molecul...
I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, includ...
The problem of multiple global comparison in families of biological sequences has been well-studied....
Motif inference represents one of the most important areas of research in computational biology, and...
In this paper we present an algorithm for the problem of planted structured motif extraction from a ...
Using our techniques for extracting approximate non-tandem repeats[1] on well constructed maximal mo...
In many time series data mining problems, the analysis can be reduced to frequent pattern mining. Sp...
Abstract. Many algorithms for motif finding that are commonly used in bioinformatics start by sampli...
Abstract—Most of existing sequence mining algorithms focuses on mining for subsequences. A large cla...
Finding motifs in biological sequences is one of the most intriguing problems for string algorithm d...
In this paper, a natural probabilistic model for motif discovery has been used to experimentally tes...
Strings, sequences and, ultimately, motifs play a key role in bioinformatics. Here, a motif is a bio...
Motif inference is at the heart of several time-demanding computational tasks, such as in molecul...
OBJECTIVE: The human genome project has resulted in the generation of voluminous biological data. No...
Motivation: Motif identification for sequences has many important applications in biological studies...
Motif inference is at the heart of several time-demanding computational tasks, such as in molecul...
I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, includ...
The problem of multiple global comparison in families of biological sequences has been well-studied....
Motif inference represents one of the most important areas of research in computational biology, and...
In this paper we present an algorithm for the problem of planted structured motif extraction from a ...
Using our techniques for extracting approximate non-tandem repeats[1] on well constructed maximal mo...
In many time series data mining problems, the analysis can be reduced to frequent pattern mining. Sp...
Abstract. Many algorithms for motif finding that are commonly used in bioinformatics start by sampli...
Abstract—Most of existing sequence mining algorithms focuses on mining for subsequences. A large cla...
Finding motifs in biological sequences is one of the most intriguing problems for string algorithm d...