International audienceIn this paper we study lower bounds for the fundamental problem of text indexing with mis-matches and differences. In this problem we are given a long string of length n, the "text", and the task is to preprocess it into a data structure such that given a query string Q, one can quickly identify substrings that are within Hamming or edit distance at most k from Q. This problem is at the core of various problems arising in biology and text processing. While exact text indexing allows linear-size data structures with linear query time, text indexing with k mismatches (or k differences) seems to be much harder: All known data structures have exponential dependency on k either in the space, or in the time bound. We provide...
[[abstract]]In this paper, we study the following three variants of the classical text indexing prob...
We consider the problem of indexing a text T (of length n) with a light data structure that supports...
In many real-world database systems, a large fraction of the data is represented by strings: Sequenc...
International audienceIn this paper we study lower bounds for the fundamental problem of text indexi...
AbstractLet T be a text of length n and P be a pattern of length m, both strings over a fixed finite...
AbstractWe present an index that stores a text of length n such that given a pattern of length m, al...
We study data structure problems related to document indexing and pattern matching queries and our m...
Let T be a text of length n and P be a pattern of length m, both strings over a fixed finite alphabe...
AbstractIn this paper we address the problem of constructing an index for a text document or a colle...
Approximate string matching is about finding a given string pattern in a text by allowing some degre...
Approximate dictionary matching (checking if a pattern occurs in a collection of strings) is a class...
Abstract. This paper revisits the problem of indexing a text S[1..n] to support searching substrings...
AbstractWe present a radically new indexing approach for approximate string matching. The scheme use...
AbstractWe present a new index for approximate string matching. The index collects text q-samples, t...
AbstractThis paper revisits the problem of indexing a text S[1..n] for pattern matching with up to k...
[[abstract]]In this paper, we study the following three variants of the classical text indexing prob...
We consider the problem of indexing a text T (of length n) with a light data structure that supports...
In many real-world database systems, a large fraction of the data is represented by strings: Sequenc...
International audienceIn this paper we study lower bounds for the fundamental problem of text indexi...
AbstractLet T be a text of length n and P be a pattern of length m, both strings over a fixed finite...
AbstractWe present an index that stores a text of length n such that given a pattern of length m, al...
We study data structure problems related to document indexing and pattern matching queries and our m...
Let T be a text of length n and P be a pattern of length m, both strings over a fixed finite alphabe...
AbstractIn this paper we address the problem of constructing an index for a text document or a colle...
Approximate string matching is about finding a given string pattern in a text by allowing some degre...
Approximate dictionary matching (checking if a pattern occurs in a collection of strings) is a class...
Abstract. This paper revisits the problem of indexing a text S[1..n] to support searching substrings...
AbstractWe present a radically new indexing approach for approximate string matching. The scheme use...
AbstractWe present a new index for approximate string matching. The index collects text q-samples, t...
AbstractThis paper revisits the problem of indexing a text S[1..n] for pattern matching with up to k...
[[abstract]]In this paper, we study the following three variants of the classical text indexing prob...
We consider the problem of indexing a text T (of length n) with a light data structure that supports...
In many real-world database systems, a large fraction of the data is represented by strings: Sequenc...