Discovering the key structure of a database is one of the main goals of data mining. In pattern set mining we do so by discovering a small set of patterns that together describe the data well. The richer the class of patterns we consider, and the more powerful our description language, the better we will be able to summarise the data. In this paper we propose \ourmethod, a novel greedy MDL-based method for summarising sequential data using rich patterns that are allowed to interleave. Experiments show \ourmethod is orders of magnitude faster than the state of the art, results in better models, as well as discovers meaningful semantics in the form patterns that identify multiple choices of values
Sequential pattern mining first proposed by Agrawal and Srikant has received intensive research due ...
Abstract — There is a huge wealth of sequence data available, for example, customer purchase histori...
The sequential pattern mining stems from the need to obtain patterns that are repeated in multiple t...
Discovering the key structure of a database is one of the main goals of data mining. In pattern set ...
Pattern mining based on data compression has been successfully applied in many data mining tasks. Fo...
The discovery of patterns plays an important role in data mining. A pattern can be any type of regul...
We propose a streaming algorithm, based on the minimal description length (MDL) principle, for extra...
We study how to obtain concise descriptions of discrete multivariate sequential data. In particular,...
This paper addresses the discovery of sequential patterns in very large databases. Most of the exist...
In this paper we present a new algorithm for fast discovery of Sequential Patterns. Given a collecti...
We present an overview of data mining techniques for extracting knowledge from large databases with ...
Pattern mining is one of the best-known concepts in Data Mining. A big problem in pattern mining is ...
: The problem of mining sequential patterns was recently introduced in [AS95]. We are given a databa...
International audienceGraph pattern mining algorithms ease graph data analysis by extracting recurri...
In order to find patterns in data, it is often necessary to aggregate or summarise data at a higher ...
Sequential pattern mining first proposed by Agrawal and Srikant has received intensive research due ...
Abstract — There is a huge wealth of sequence data available, for example, customer purchase histori...
The sequential pattern mining stems from the need to obtain patterns that are repeated in multiple t...
Discovering the key structure of a database is one of the main goals of data mining. In pattern set ...
Pattern mining based on data compression has been successfully applied in many data mining tasks. Fo...
The discovery of patterns plays an important role in data mining. A pattern can be any type of regul...
We propose a streaming algorithm, based on the minimal description length (MDL) principle, for extra...
We study how to obtain concise descriptions of discrete multivariate sequential data. In particular,...
This paper addresses the discovery of sequential patterns in very large databases. Most of the exist...
In this paper we present a new algorithm for fast discovery of Sequential Patterns. Given a collecti...
We present an overview of data mining techniques for extracting knowledge from large databases with ...
Pattern mining is one of the best-known concepts in Data Mining. A big problem in pattern mining is ...
: The problem of mining sequential patterns was recently introduced in [AS95]. We are given a databa...
International audienceGraph pattern mining algorithms ease graph data analysis by extracting recurri...
In order to find patterns in data, it is often necessary to aggregate or summarise data at a higher ...
Sequential pattern mining first proposed by Agrawal and Srikant has received intensive research due ...
Abstract — There is a huge wealth of sequence data available, for example, customer purchase histori...
The sequential pattern mining stems from the need to obtain patterns that are repeated in multiple t...