Top-k approximate querying on string collections is an important data analysis tool for many applications, and it has been exhaustively studied. However, the scale of the problem has increased dramatically because of the prevalence of the Web. In this paper, we aim to explore the efficient top-k similar string matching problem. Several efficient strategies are introduced, such as length aware and adaptive q-gram selection. We present a general q-gram based framework and propose two efficient algorithms based on the strategies introduced. Our techniques are experimentally evaluated on three real data sets and show a superior performance
AbstractAn approximate string-matching algorithm is described based on earlier attribute-matching al...
We give two algorithms for finding all approximate matches of a pattern in a text, where the edit di...
We give two algorithms for finding all approximate matches of a pattern in a text, where the edit di...
There is a wide range of applications that require to query a large database of texts to search for ...
AbstractWe study approximate string-matching in connection with two string distance functions that a...
We present new algorithms for approximate string matching based in simple, but efficient, ideas. Fir...
We improve the fastest known algorithm for approximate string matching. This algorithm can only be u...
Approximate string matching is a basic and important concept in many applications of information ret...
AbstractWe present a new index for approximate string matching. The index collects text q-samples, t...
Given a text T of length n and a pattern P of length m, the string matching problem is a task to fin...
Given a collection of strings, goal of the approximate string matching is to efficiently find the st...
A popular and well-studied class of filters for approximate string matching compares substrings of l...
We survey the current techniques to cope with the problem of string matching that allows errors. Thi...
Many database applications require similarity based retrieval on stored text and/or multimedia objec...
We present a new algorithm for multiple approximate string matching. It is based on reading backward...
AbstractAn approximate string-matching algorithm is described based on earlier attribute-matching al...
We give two algorithms for finding all approximate matches of a pattern in a text, where the edit di...
We give two algorithms for finding all approximate matches of a pattern in a text, where the edit di...
There is a wide range of applications that require to query a large database of texts to search for ...
AbstractWe study approximate string-matching in connection with two string distance functions that a...
We present new algorithms for approximate string matching based in simple, but efficient, ideas. Fir...
We improve the fastest known algorithm for approximate string matching. This algorithm can only be u...
Approximate string matching is a basic and important concept in many applications of information ret...
AbstractWe present a new index for approximate string matching. The index collects text q-samples, t...
Given a text T of length n and a pattern P of length m, the string matching problem is a task to fin...
Given a collection of strings, goal of the approximate string matching is to efficiently find the st...
A popular and well-studied class of filters for approximate string matching compares substrings of l...
We survey the current techniques to cope with the problem of string matching that allows errors. Thi...
Many database applications require similarity based retrieval on stored text and/or multimedia objec...
We present a new algorithm for multiple approximate string matching. It is based on reading backward...
AbstractAn approximate string-matching algorithm is described based on earlier attribute-matching al...
We give two algorithms for finding all approximate matches of a pattern in a text, where the edit di...
We give two algorithms for finding all approximate matches of a pattern in a text, where the edit di...