Approximate dictionary matching (checking if a pattern occurs in a collection of strings) is a classic problem with applications in e.g. spellchecking, online catalogs, and web searchers. We present a simple solution called split index, which is based on the Dirichlet principle, for matching a keyword with few mismatches, and experimentally show that it offers competitive space-time tradeoffs. Our implementation in the C++ language is focused mostly on data compaction, which is beneficial for the search speed. We compare our solution with other algorithms and we show that it is faster when the Hamming distance is used. Query times in the order of 1 microsecond were reported for one mismatch for a few-megabyte natural language dictionary on ...
AbstractWe present a new index for approximate string matching. The index collects text q-samples, t...
We revisit the problem of indexing a string S[1..n] to support finding all substrings in S that matc...
International audienceIn this paper we study lower bounds for the fundamental problem of text indexi...
AbstractLet T be a text of length n and P be a pattern of length m, both strings over a fixed finite...
We revisit the fundamental problem of dictionary look-up with mismatches. Given a set (dictionary) o...
DOI: 10.1109/DCC.2011.18Given a set D of d patterns of total length n, the dictionary matching probl...
Let T be a text of length n and P be a pattern of length m, both strings over a fixed finite alphabe...
Approximate string matching is about finding a given string pattern in a text by allowing some degre...
In the k-mismatch problem we are given a pattern of length m and a text and must find all locations ...
In this paper we present different solutions for the problem of indexing a dictionary of strings in ...
International audienceWe study strategies of approximate pattern matching that exploit bidirectional...
AbstractWe present an index that stores a text of length n such that given a pattern of length m, al...
This paper revisits the problem of indexing a text S[1.,n] to support searching substrings in S that...
This paper deals with the approximate string-matching problem with Hamming distance. The approximate...
The objective of the research in this dissertation is to derive optimal search schemes for approxima...
AbstractWe present a new index for approximate string matching. The index collects text q-samples, t...
We revisit the problem of indexing a string S[1..n] to support finding all substrings in S that matc...
International audienceIn this paper we study lower bounds for the fundamental problem of text indexi...
AbstractLet T be a text of length n and P be a pattern of length m, both strings over a fixed finite...
We revisit the fundamental problem of dictionary look-up with mismatches. Given a set (dictionary) o...
DOI: 10.1109/DCC.2011.18Given a set D of d patterns of total length n, the dictionary matching probl...
Let T be a text of length n and P be a pattern of length m, both strings over a fixed finite alphabe...
Approximate string matching is about finding a given string pattern in a text by allowing some degre...
In the k-mismatch problem we are given a pattern of length m and a text and must find all locations ...
In this paper we present different solutions for the problem of indexing a dictionary of strings in ...
International audienceWe study strategies of approximate pattern matching that exploit bidirectional...
AbstractWe present an index that stores a text of length n such that given a pattern of length m, al...
This paper revisits the problem of indexing a text S[1.,n] to support searching substrings in S that...
This paper deals with the approximate string-matching problem with Hamming distance. The approximate...
The objective of the research in this dissertation is to derive optimal search schemes for approxima...
AbstractWe present a new index for approximate string matching. The index collects text q-samples, t...
We revisit the problem of indexing a string S[1..n] to support finding all substrings in S that matc...
International audienceIn this paper we study lower bounds for the fundamental problem of text indexi...