Approximate query processing based on multiple similarity metrics is prevalent and essential for many applications in the database area, such as information retrieval, data mining, pattern matching, and so forth. Almost all existing works adopt the filter-and-verify paradigm. A good filtering method directly affects the final performance and is related to two aspects: small filtering cost and strong pruning power. In this thesis, we mainly designed various powerful filtering algorithms that solve three similarity metrics. The first work is approximate query search with edit distance, which finds all data strings whose edit distances with a given query are no larger than a given threshold. All existing works solve the problem adopting the s...
We discuss using an indexing scheme to accelerate approximate search over a static text in the case ...
Approximate string matching methods are utilized by a vast number of duplicate detection and cluster...
Search engines and recommendation systems are built to efficiently display relevant information from...
Edit distance similarity search, also called approximate pattern matching, is a fundamental problem ...
In this thesis, we study efficient exact query processing algorithms for edit similarity queries and...
This paper proposes new solutions for the approximate dictionary queries problem. These solutions co...
Today, a myriad of data sources, from the Internet to business operations to scientific instruments,...
Metric databases are databases where a metric distance function is defined for pairs of database obj...
In this thesis, we study the Hamming distance query problem. Hamming distance measures the number of...
Similarity search has become one of the important parts of many applications including multimedia re...
AbstractWe review the major paradigms for approximate similarity queries and propose a classificatio...
There is a wide range of applications that require to query a large database of texts to search for ...
Thesis (Ph. D. in Engineering)--University of Tsukuba, (A), no. 6069. 2012.3.23Includes bibliographi...
We review the major paradigms for approximate similarity queries and propose a classification schema...
Many modern applications deal with complex data, where retrieval by similarity plays an important ro...
We discuss using an indexing scheme to accelerate approximate search over a static text in the case ...
Approximate string matching methods are utilized by a vast number of duplicate detection and cluster...
Search engines and recommendation systems are built to efficiently display relevant information from...
Edit distance similarity search, also called approximate pattern matching, is a fundamental problem ...
In this thesis, we study efficient exact query processing algorithms for edit similarity queries and...
This paper proposes new solutions for the approximate dictionary queries problem. These solutions co...
Today, a myriad of data sources, from the Internet to business operations to scientific instruments,...
Metric databases are databases where a metric distance function is defined for pairs of database obj...
In this thesis, we study the Hamming distance query problem. Hamming distance measures the number of...
Similarity search has become one of the important parts of many applications including multimedia re...
AbstractWe review the major paradigms for approximate similarity queries and propose a classificatio...
There is a wide range of applications that require to query a large database of texts to search for ...
Thesis (Ph. D. in Engineering)--University of Tsukuba, (A), no. 6069. 2012.3.23Includes bibliographi...
We review the major paradigms for approximate similarity queries and propose a classification schema...
Many modern applications deal with complex data, where retrieval by similarity plays an important ro...
We discuss using an indexing scheme to accelerate approximate search over a static text in the case ...
Approximate string matching methods are utilized by a vast number of duplicate detection and cluster...
Search engines and recommendation systems are built to efficiently display relevant information from...