International audiencePattern Mining (PM) has a prominent place in Data Science and finds its application in a wide range of domains. To avoid the exponential explosion of patterns different methods have been proposed. They are based on assumptions on interestingness and usually return very different pattern sets. In this paper we propose to use a compression-based objective as a well-justified and robust interestingness measure. We define the description lengths for datasets and use the Minimum Description Length principle (MDL) to find patterns that ensure the best compression. Our experiments show that the application of MDL to numerical data provides a small and characteristic subsets of patterns describing data in a compact way
Compression based pattern mining has been successfully applied to many data mining tasks. We propose...
In this paper we study the extraction of closed patterns associated to their generators in numerical...
Pattern Mining is one of the most researched topics in the data mining community. Literally hundreds...
International audiencePattern Mining (PM) has a prominent place in Data Science and finds its applic...
This is about the Minimum Description Length (MDL) principle applied to pattern mining. The length o...
International audienceIn this paper, we investigate the problem of mining numerical data in the fram...
The discovery of patterns plays an important role in data mining. A pattern can be any type of regul...
Pattern mining is one of the best-known concepts in Data Mining. A big problem in pattern mining is ...
International audienceAbstract Pattern mining is well established in data mining research, especiall...
Pattern mining is well established in data mining research, especiallyfor mining binary datasets. Su...
Pattern mining based on data compression has been successfully applied in many data mining tasks. Fo...
International audienceThe Minimal Description Length (MDL) principle is a powerful and well founded ...
In this thesis, we study different aspects of pattern mining in binary and numerical tabular dataset...
When considering a data set it is often unknown how complex it is, and hence it is difficult to asse...
International audiencePattern Mining is a well-studied field in Data Mining and Machine Learning. Th...
Compression based pattern mining has been successfully applied to many data mining tasks. We propose...
In this paper we study the extraction of closed patterns associated to their generators in numerical...
Pattern Mining is one of the most researched topics in the data mining community. Literally hundreds...
International audiencePattern Mining (PM) has a prominent place in Data Science and finds its applic...
This is about the Minimum Description Length (MDL) principle applied to pattern mining. The length o...
International audienceIn this paper, we investigate the problem of mining numerical data in the fram...
The discovery of patterns plays an important role in data mining. A pattern can be any type of regul...
Pattern mining is one of the best-known concepts in Data Mining. A big problem in pattern mining is ...
International audienceAbstract Pattern mining is well established in data mining research, especiall...
Pattern mining is well established in data mining research, especiallyfor mining binary datasets. Su...
Pattern mining based on data compression has been successfully applied in many data mining tasks. Fo...
International audienceThe Minimal Description Length (MDL) principle is a powerful and well founded ...
In this thesis, we study different aspects of pattern mining in binary and numerical tabular dataset...
When considering a data set it is often unknown how complex it is, and hence it is difficult to asse...
International audiencePattern Mining is a well-studied field in Data Mining and Machine Learning. Th...
Compression based pattern mining has been successfully applied to many data mining tasks. We propose...
In this paper we study the extraction of closed patterns associated to their generators in numerical...
Pattern Mining is one of the most researched topics in the data mining community. Literally hundreds...