This paper proposes new solutions for the approximate dictionary queries problem. These solutions combine the use of succinct data structures with an efficient representation of the keys to significantly reduce the space usage of the state-of-the-art solutions without introducing any time penalty. Finally, by exploiting triangle inequality, we can also significantly speed up the query time of the existing solutions
We focus on the efficient search for the most similar bit strings to a given query in the Hamming sp...
To speed up similarity based searches many indexing techniques have been proposed in order to addres...
A common problem in many types of databases is retrieving the most similar matches to a query object...
Searching for similar objects in a collection is a core task of many applications in databases, patt...
In this thesis, we study the Hamming distance query problem. Hamming distance measures the number of...
Abstract—We consider the problem of finding similar patterns in a time sequence. Typical application...
Approximate query processing based on multiple similarity metrics is prevalent and essential for man...
Today, a myriad of data sources, from the Internet to business operations to scientific instruments,...
Edit distance similarity search, also called approximate pattern matching, is a fundamental problem ...
Approximate dictionary matching (checking if a pattern occurs in a collection of strings) is a class...
This thesis addresses the problem of indexing and querying very large databases of binary vectors. S...
AbstractWe present an index that stores a text of length n such that given a pattern of length m, al...
Nearest neighbor searching of large databases in high-dimensional spaces is inherently difficult due...
In this paper we present an indexing method for probably approximately correct nearest neighbor quer...
This thesis elaborates on the problem of preprocessing a large graph so that single-pair shortest-pa...
We focus on the efficient search for the most similar bit strings to a given query in the Hamming sp...
To speed up similarity based searches many indexing techniques have been proposed in order to addres...
A common problem in many types of databases is retrieving the most similar matches to a query object...
Searching for similar objects in a collection is a core task of many applications in databases, patt...
In this thesis, we study the Hamming distance query problem. Hamming distance measures the number of...
Abstract—We consider the problem of finding similar patterns in a time sequence. Typical application...
Approximate query processing based on multiple similarity metrics is prevalent and essential for man...
Today, a myriad of data sources, from the Internet to business operations to scientific instruments,...
Edit distance similarity search, also called approximate pattern matching, is a fundamental problem ...
Approximate dictionary matching (checking if a pattern occurs in a collection of strings) is a class...
This thesis addresses the problem of indexing and querying very large databases of binary vectors. S...
AbstractWe present an index that stores a text of length n such that given a pattern of length m, al...
Nearest neighbor searching of large databases in high-dimensional spaces is inherently difficult due...
In this paper we present an indexing method for probably approximately correct nearest neighbor quer...
This thesis elaborates on the problem of preprocessing a large graph so that single-pair shortest-pa...
We focus on the efficient search for the most similar bit strings to a given query in the Hamming sp...
To speed up similarity based searches many indexing techniques have been proposed in order to addres...
A common problem in many types of databases is retrieving the most similar matches to a query object...