Computing the similarity between sequences is a very important challenge for many different data mining tasks. There is a plethora of similarity measures for sequences in the literature, most of them being designed for sequences of items. In this work, we study the problem of measuring the similarity between sequences of itemsets. We focus on the notion of common subsequences as a way to measure similarity between a pair of sequences composed of a list of itemsets. We present new combinatorial results for efficiently counting distinct and common subsequences. These theoretical results are the cornerstone of an effective dynamic programming approach to deal with this problem. In addition, we propose an approximate method to speed up the comp...
Existing dynamic-programming algorithms for identifying similar regions of two sequences require tim...
Abstract—Emerging sequences (ESs) represent some strong distinguishing knowledge and are very useful...
A subsequence is obtained from a string by deleting any number of characters; thus in contrast to a ...
International audienceComputing the similarity between sequences is a very important challenge for m...
Computing the similarity between sequences is a very important challenge for many different data min...
Computing the similarity between sequences is a very important challenge for many different data min...
A growing number of measures of sequence similarity is being based on some underlying notion of rela...
Efficient and expressive comparison of sequences is an essential procedure for learning with se-quen...
The minimal-length encoding approach is applied to define concept of sequence similarity. A sequence...
We present a fast algorithm for sequence clustering and searching which works with large sequence da...
This article reviews objections to optimal-matching (OM) algorithms in sequence analysis and reformu...
International audienceIn this paper we study joint sequence complexity and its applications for find...
While there has been a lot of work on finding frequent itemsets in transaction data streams, none of...
There exist many computational methods for finding similarity in gene sequence, finding suitable met...
We discuss several approaches to similarity preserving coding of symbol sequences and possible conne...
Existing dynamic-programming algorithms for identifying similar regions of two sequences require tim...
Abstract—Emerging sequences (ESs) represent some strong distinguishing knowledge and are very useful...
A subsequence is obtained from a string by deleting any number of characters; thus in contrast to a ...
International audienceComputing the similarity between sequences is a very important challenge for m...
Computing the similarity between sequences is a very important challenge for many different data min...
Computing the similarity between sequences is a very important challenge for many different data min...
A growing number of measures of sequence similarity is being based on some underlying notion of rela...
Efficient and expressive comparison of sequences is an essential procedure for learning with se-quen...
The minimal-length encoding approach is applied to define concept of sequence similarity. A sequence...
We present a fast algorithm for sequence clustering and searching which works with large sequence da...
This article reviews objections to optimal-matching (OM) algorithms in sequence analysis and reformu...
International audienceIn this paper we study joint sequence complexity and its applications for find...
While there has been a lot of work on finding frequent itemsets in transaction data streams, none of...
There exist many computational methods for finding similarity in gene sequence, finding suitable met...
We discuss several approaches to similarity preserving coding of symbol sequences and possible conne...
Existing dynamic-programming algorithms for identifying similar regions of two sequences require tim...
Abstract—Emerging sequences (ESs) represent some strong distinguishing knowledge and are very useful...
A subsequence is obtained from a string by deleting any number of characters; thus in contrast to a ...