We describe a model for strings of characters that is loosely based on the Lempel Ziv model with the addi-tion that a repeated substring can be an approximate match to the original substring; this is close to the situation of DNA, for example. Typically there are many explanations for a given string under the model, some optimal and many suboptimal. Rather than com-mit to one optimal explanation, we sum the probabili-ties over all explanations under the model because this gives the probability of the data under the model. The model has a small number of parameters and these can be estimated from the given string by an expectation-maximization (EM) algorithm. Each iteration of the EM algorithm takes O(n2) time and a few iterations are typica...
Abstract. We present a solution to the problem of performing approx-imate pattern matching on compre...
This work raises the question of approximating the compressibility of a string with respect to a fix...
Given a string x = x[1..n] on an alphabet of size a, and a threshold pmin = 1, we first describe a n...
Abstract. Countless variants of the Lempel-Ziv compression are widely used in many real-life applica...
Abstract. Motivated by the imminent growth of massive, highly redun-dant genomic databases we study ...
We study the approximate string matching and regular expression matching problem for the case when t...
The amount of the data in the world enlarges all the time and therefore efficient methods are needed...
In bio-sequence repositories and other applications, like for instance in the production of a Cd-rom...
We present a simple algorithm which for an explicitly given input string pat (a pattern) and a stand...
AbstractWe present a simple algorithm which for an explicitly given input string pat (a pattern) and...
Matching a biological sequence against a probabilistic pattern (or profile) is a common task in comp...
Abstract. Given a string x = x[1..n] on an alphabet of size α, and a threshold pmin ≥ 1, we first de...
Combinatorics on words began more than a century ago with a demonstration that an infinitely long st...
1 Introduction and Related Work Approximate string matching (ASM) is an important problem that arise...
Rapid advancements in research in the field of DNA sequence discovery has led to a vast range of com...
Abstract. We present a solution to the problem of performing approx-imate pattern matching on compre...
This work raises the question of approximating the compressibility of a string with respect to a fix...
Given a string x = x[1..n] on an alphabet of size a, and a threshold pmin = 1, we first describe a n...
Abstract. Countless variants of the Lempel-Ziv compression are widely used in many real-life applica...
Abstract. Motivated by the imminent growth of massive, highly redun-dant genomic databases we study ...
We study the approximate string matching and regular expression matching problem for the case when t...
The amount of the data in the world enlarges all the time and therefore efficient methods are needed...
In bio-sequence repositories and other applications, like for instance in the production of a Cd-rom...
We present a simple algorithm which for an explicitly given input string pat (a pattern) and a stand...
AbstractWe present a simple algorithm which for an explicitly given input string pat (a pattern) and...
Matching a biological sequence against a probabilistic pattern (or profile) is a common task in comp...
Abstract. Given a string x = x[1..n] on an alphabet of size α, and a threshold pmin ≥ 1, we first de...
Combinatorics on words began more than a century ago with a demonstration that an infinitely long st...
1 Introduction and Related Work Approximate string matching (ASM) is an important problem that arise...
Rapid advancements in research in the field of DNA sequence discovery has led to a vast range of com...
Abstract. We present a solution to the problem of performing approx-imate pattern matching on compre...
This work raises the question of approximating the compressibility of a string with respect to a fix...
Given a string x = x[1..n] on an alphabet of size a, and a threshold pmin = 1, we first describe a n...