Frequent sequence mining is one of the fundamental building blocks in data mining. While the problem has been extensively studied, few of the available techniques are sufficiently scalable to handle datasets with billions of sequences; such large-scale datasets arise, for instance, in text mining and session analysis. In this article, we propose MG-FSM, a scalable algorithm for frequent sequence mining on MapReduce. MG-FSM can handle so-called “gap constraints”, which can be used to limit the output to a controlled set of frequent sequences. Both positional and temporal gap constraints, as well as appropriate maximality and closedness constraints, are supported. At its heart, MG-FSM partitions the input database in a way that allows us to m...
We study two problems: (1) mining frequent sequences from a transactional database, and (2) incremen...
Abstract — There is a huge wealth of sequence data available, for example, customer purchase histori...
Frequent sequence mining methods often make use of constraints to control which subsequences should ...
Frequent sequence mining is one of the fundamental building blocks in data mining. While the problem...
Frequent sequence mining is one of the fundamental building blocks in data mining. While the problem...
In this thesis, we study scalable and general purpose methods for mining frequent sequences that sat...
We study scalable algorithms for frequent sequence mining under flexible subsequence constraints. Su...
Frequent sequence mining methods often make use of constraints to control which subsequences should ...
[[abstract]]This paper propose a novel algorithm for mining closed frequent sequences, a scalable, c...
[[abstract]]Mining frequent sequences in large databases has been an important research topic. The m...
The main advantage of Constraint Programming (CP) approaches for sequential pattern mining (SPM) is ...
Abstract—Classic support based approaches efficiently ad-dress frequent sequence mining. However, su...
The number of applications generating sequential data is exploding. This work studies the discoverin...
Abstract—Classic support based approaches efficiently address frequent sequence mining. However, sup...
Frequent patter mining has been around ever since the data mining domain came into popularity. Howev...
We study two problems: (1) mining frequent sequences from a transactional database, and (2) incremen...
Abstract — There is a huge wealth of sequence data available, for example, customer purchase histori...
Frequent sequence mining methods often make use of constraints to control which subsequences should ...
Frequent sequence mining is one of the fundamental building blocks in data mining. While the problem...
Frequent sequence mining is one of the fundamental building blocks in data mining. While the problem...
In this thesis, we study scalable and general purpose methods for mining frequent sequences that sat...
We study scalable algorithms for frequent sequence mining under flexible subsequence constraints. Su...
Frequent sequence mining methods often make use of constraints to control which subsequences should ...
[[abstract]]This paper propose a novel algorithm for mining closed frequent sequences, a scalable, c...
[[abstract]]Mining frequent sequences in large databases has been an important research topic. The m...
The main advantage of Constraint Programming (CP) approaches for sequential pattern mining (SPM) is ...
Abstract—Classic support based approaches efficiently ad-dress frequent sequence mining. However, su...
The number of applications generating sequential data is exploding. This work studies the discoverin...
Abstract—Classic support based approaches efficiently address frequent sequence mining. However, sup...
Frequent patter mining has been around ever since the data mining domain came into popularity. Howev...
We study two problems: (1) mining frequent sequences from a transactional database, and (2) incremen...
Abstract — There is a huge wealth of sequence data available, for example, customer purchase histori...
Frequent sequence mining methods often make use of constraints to control which subsequences should ...