A growing number of measures of sequence similarity is being based on some underlying notion of relative compressibility. Within this paradigm, similar sequences are expected to share a large number of common substrings, or subsequences, or more complex patterns or motifs, and so on. The computational complexity of such measures varies, and it increases with the complexion of the patterns taken into account. At the low end of the spectrum, most measures based on the bags of shared substrings are typically afforded in linear time. This performance is no longer achievable as soon as some degree of distortion is accepted. In this paper, measures of sequence similarity are introduced and studied in which patterns in a pair are considered simila...
Efficient and expressive comparison of sequences is an essential procedure for learning with se-quen...
Local alignment-free sequence comparison arises in the context of identifying similar seg-ments of s...
Abstract The increasing throughput of sequencing raises growing needs for methods of sequence analys...
We study the problem of similarity detection by sequence alignment with gaps, using a recently estab...
One way to represent a DNA sequence is to break it down into substrings of length L, called L-tuples...
Computing the similarity between sequences is a very important challenge for many different data min...
We study the problem of similarity detection by sequence alignment with gaps, using a recently estab...
Sequence comparison is a fundamental task in computational biology, traditionally dominated by align...
This article reviews objections to optimal-matching (OM) algorithms in sequence analysis and reformu...
The statistical properties of local alignment algorithms with gaps are analyzed theoretically for uu...
AbstractAlgorithms for finding similar, or highly conserved, regions in a group of sequences are at ...
International audienceGiven two strings of size n over a constant alphabet, the classical algorithm ...
Algorithms for finding similar, or highly conserved, regions in a group of sequences are at the core...
We discuss several approaches to similarity preserving coding of symbol sequences and possible conne...
Abstract. Given two strings ofsize n over a constant alphabet, the classical algorithm for computing...
Efficient and expressive comparison of sequences is an essential procedure for learning with se-quen...
Local alignment-free sequence comparison arises in the context of identifying similar seg-ments of s...
Abstract The increasing throughput of sequencing raises growing needs for methods of sequence analys...
We study the problem of similarity detection by sequence alignment with gaps, using a recently estab...
One way to represent a DNA sequence is to break it down into substrings of length L, called L-tuples...
Computing the similarity between sequences is a very important challenge for many different data min...
We study the problem of similarity detection by sequence alignment with gaps, using a recently estab...
Sequence comparison is a fundamental task in computational biology, traditionally dominated by align...
This article reviews objections to optimal-matching (OM) algorithms in sequence analysis and reformu...
The statistical properties of local alignment algorithms with gaps are analyzed theoretically for uu...
AbstractAlgorithms for finding similar, or highly conserved, regions in a group of sequences are at ...
International audienceGiven two strings of size n over a constant alphabet, the classical algorithm ...
Algorithms for finding similar, or highly conserved, regions in a group of sequences are at the core...
We discuss several approaches to similarity preserving coding of symbol sequences and possible conne...
Abstract. Given two strings ofsize n over a constant alphabet, the classical algorithm for computing...
Efficient and expressive comparison of sequences is an essential procedure for learning with se-quen...
Local alignment-free sequence comparison arises in the context of identifying similar seg-ments of s...
Abstract The increasing throughput of sequencing raises growing needs for methods of sequence analys...