This paper introduces the sequence covering similarity, that we formally define for evaluating the similarity between a symbolic sequence (string) and a set of symbolic sequences (strings). From this covering similarity we derive a pair-wise distance to compare two symbolic sequences. We show that this covering distance is a semimetric. Few examples are given to show how this string metric in $O(n \cdot log n)$ compares with the Levenshtein's distance that is in $O(n^2)$. A final example presents its application to plagiarism detection
We give an efficient protocol for sequence comparisons of the edit-distance kind, such that neither ...
Similarity search is a fundamental problem in information technology. The main difficulty of this pr...
AbstractThe problem of computing the similarity of two run-length encoded strings has been studied f...
This paper introduces the sequence covering similarity, that we formally define for evaluating the s...
Efficient and expressive comparison of sequences is an essential procedure for learning with se-quen...
We discuss several approaches to similarity preserving coding of symbol sequences and possible conne...
International audienceThis paper introduces a new similarity measure, the covering similarity, that ...
A growing number of measures of sequence similarity is being based on some underlying notion of rela...
String kernel-based machine learning methods have yielded great success in practical tasks of struct...
Computing the similarity between sequences is a very important challenge for many different data min...
irements. A simple and computationally very effective "distance" measure for sequences is ...
International audienceIn this paper we consider structural comparison of sequences, that is, to comp...
Nowadays sequences of symbols are becoming more important, as they are the standard format for repre...
Comparison functions for sequences (of symbols) are important components of many applications, for e...
This article reviews objections to optimal-matching (OM) algorithms in sequence analysis and reformu...
We give an efficient protocol for sequence comparisons of the edit-distance kind, such that neither ...
Similarity search is a fundamental problem in information technology. The main difficulty of this pr...
AbstractThe problem of computing the similarity of two run-length encoded strings has been studied f...
This paper introduces the sequence covering similarity, that we formally define for evaluating the s...
Efficient and expressive comparison of sequences is an essential procedure for learning with se-quen...
We discuss several approaches to similarity preserving coding of symbol sequences and possible conne...
International audienceThis paper introduces a new similarity measure, the covering similarity, that ...
A growing number of measures of sequence similarity is being based on some underlying notion of rela...
String kernel-based machine learning methods have yielded great success in practical tasks of struct...
Computing the similarity between sequences is a very important challenge for many different data min...
irements. A simple and computationally very effective "distance" measure for sequences is ...
International audienceIn this paper we consider structural comparison of sequences, that is, to comp...
Nowadays sequences of symbols are becoming more important, as they are the standard format for repre...
Comparison functions for sequences (of symbols) are important components of many applications, for e...
This article reviews objections to optimal-matching (OM) algorithms in sequence analysis and reformu...
We give an efficient protocol for sequence comparisons of the edit-distance kind, such that neither ...
Similarity search is a fundamental problem in information technology. The main difficulty of this pr...
AbstractThe problem of computing the similarity of two run-length encoded strings has been studied f...