Edit distance is the most widely used method to quantify similarity between two strings. We investigate the problem of similarity search under edit distance. Given a collection of sequences, the goal of similarity search under edit distance is to find sequences in the collection that are similar to a given query sequence where the similarity score is computed using edit distance. The canonical method of computing edit distance between two strings uses a dynamic programming-based approach that runs in quadratic time and space, which may not provide results in a reasonable amount of time for large sequences. It advocates for parallel algorithms to reduce the time taken by edit distance computation. To this end, we present scalable parallel al...
Tree-structured data are becoming ubiquitous nowadays and manipulating them based on similarity is e...
Metric databases are databases where a metric distance function is defined for pairs of database obj...
Edit distance measures the similarity between two strings (as the minimum number of change, insert o...
In this thesis, we study efficient exact query processing algorithms for edit similarity queries and...
Edit distance similarity search, also called approximate pattern matching, is a fundamental problem ...
The edit distance is the most famous distance to compute the similarity between two strings of chara...
Abstract—Edit distance is widely used for measuring the similarity between two strings. As a primiti...
Fast similarity search is important for time-sensitive applications. Those include both enterprise a...
Similarity search is important for many data-intensive applications to identify a set of similar obj...
In this paper, we study efficient parallel edit distance algorithms, both in theory and in practice....
The P-Bigram method is a string comparison methods base on an internal two characters-based similari...
In this paper, we consider the problem of efficient matching and retrieval of sequences of different...
The similarity search consists on retrieving all objects within a database that are similar or relev...
We give an efficient protocol for sequence comparisons of the edit-distance kind, such that neither ...
Abstract Background The problem of approximate string matching is important in many different areas ...
Tree-structured data are becoming ubiquitous nowadays and manipulating them based on similarity is e...
Metric databases are databases where a metric distance function is defined for pairs of database obj...
Edit distance measures the similarity between two strings (as the minimum number of change, insert o...
In this thesis, we study efficient exact query processing algorithms for edit similarity queries and...
Edit distance similarity search, also called approximate pattern matching, is a fundamental problem ...
The edit distance is the most famous distance to compute the similarity between two strings of chara...
Abstract—Edit distance is widely used for measuring the similarity between two strings. As a primiti...
Fast similarity search is important for time-sensitive applications. Those include both enterprise a...
Similarity search is important for many data-intensive applications to identify a set of similar obj...
In this paper, we study efficient parallel edit distance algorithms, both in theory and in practice....
The P-Bigram method is a string comparison methods base on an internal two characters-based similari...
In this paper, we consider the problem of efficient matching and retrieval of sequences of different...
The similarity search consists on retrieving all objects within a database that are similar or relev...
We give an efficient protocol for sequence comparisons of the edit-distance kind, such that neither ...
Abstract Background The problem of approximate string matching is important in many different areas ...
Tree-structured data are becoming ubiquitous nowadays and manipulating them based on similarity is e...
Metric databases are databases where a metric distance function is defined for pairs of database obj...
Edit distance measures the similarity between two strings (as the minimum number of change, insert o...