String data is ubiquitous, and its management has taken on particular importance in the past few years. Approximate queries are very important on string data. This is due, for example, to the prevalence of typographical errors in data, and multiple conventions for recording attributes such as name and address. Commercial databases do not support approximate string queries directly, and it is a challenge to implement this functionality efficiently with user-defined functions (UDFs). In this paper, we develop a technique for building approximate string processing capabilities on top of commercial databases by exploiting facilities already available in them. At the core, our technique relies on generating short substrings of length q, called q...
In this thesis, we study efficient exact query processing algorithms for edit similarity queries and...
Abstract Background The problem of approximate string matching is important in many different areas ...
A string similarity measure quantifies the similarity between two text strings for approximate strin...
String data is ubiquitous, and its management has taken on particular importance in the past few yea...
There is a wide range of applications that require to query a large database of texts to search for ...
AbstractWe study approximate string-matching in connection with two string distance functions that a...
Given a collection of strings, goal of the approximate string matching is to efficiently find the st...
Approximate queries on string data are important, due to the prevalence of such data in databases an...
Many database applications require similarity based retrieval on stored text and/or multimedia objec...
AbstractWe present a new index for approximate string matching. The index collects text q-samples, t...
Entity extraction (also known as entity recognition) extracts entities (e.g., person names, location...
Top-k approximate querying on string collections is an important data analysis tool for many applica...
145 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1993.For some applications, it may...
With the widespread use of the internet, text-based data sources have become ubiquitous and the dema...
We survey the current techniques to cope with the problem of string matching that allows errors. Thi...
In this thesis, we study efficient exact query processing algorithms for edit similarity queries and...
Abstract Background The problem of approximate string matching is important in many different areas ...
A string similarity measure quantifies the similarity between two text strings for approximate strin...
String data is ubiquitous, and its management has taken on particular importance in the past few yea...
There is a wide range of applications that require to query a large database of texts to search for ...
AbstractWe study approximate string-matching in connection with two string distance functions that a...
Given a collection of strings, goal of the approximate string matching is to efficiently find the st...
Approximate queries on string data are important, due to the prevalence of such data in databases an...
Many database applications require similarity based retrieval on stored text and/or multimedia objec...
AbstractWe present a new index for approximate string matching. The index collects text q-samples, t...
Entity extraction (also known as entity recognition) extracts entities (e.g., person names, location...
Top-k approximate querying on string collections is an important data analysis tool for many applica...
145 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1993.For some applications, it may...
With the widespread use of the internet, text-based data sources have become ubiquitous and the dema...
We survey the current techniques to cope with the problem of string matching that allows errors. Thi...
In this thesis, we study efficient exact query processing algorithms for edit similarity queries and...
Abstract Background The problem of approximate string matching is important in many different areas ...
A string similarity measure quantifies the similarity between two text strings for approximate strin...