Abstract—We present a new approach related to the discovery of correlated patterns based on the use of multicore architectures. Our work rests on a full Knowledge Discovery in Databases system allowing one to extract Decision Correlation Rules based on the Chi-squared criterion from any database that includes a target column. We use a levelwise algorithm as well as contingency vectors, an alternate and more powerful representation of contin-gency tables. The goal is to parallelize the extraction of relevant rules by invoking the Parallel Patterns Library which allows a simultaneous access to the whole available cores on modern computers. We finally present first results and performance gains
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 1998. Simultaneously published...
Data mining is an emerging research area, whose goal is to extract significant patterns or interesti...
We present parallel algorithms for mining Correlated Heavy Hitters from a two-dimensional data strea...
Part 1: Cross-Domain Conference and Workshop on Multidisciplinary Research and Practice for Informat...
6 pagesInternational audienceIn this paper, we present a new approach relevant to the discovery of c...
16 pagesInternational audienceIn this paper, we present a new approach relevant to the discovery of ...
In this paper, two concepts are introduced: decision correlation rules and contingency vectors. The ...
International audienceGradual patterns highlight complex order correlations of the form "The more/le...
We study mining correlations from quantitative databases and show that this is a more effective appr...
One of the important problems in data mining is discovering association rules from databases of tran...
The problem of mining hidden associations present in the large amounts of data has seen widespread a...
The amount of data produced by ubiquitous computing applications is quickly growing, due to the perv...
Abstract. To mine databases in which examples are tagged with class labels, the minimum correlation ...
Pattern mining has been a hot issue since it was first proposed for market basket analysis. Even tho...
Data mining is an emerging research area, whose goal is to extract significant patterns or interesti...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 1998. Simultaneously published...
Data mining is an emerging research area, whose goal is to extract significant patterns or interesti...
We present parallel algorithms for mining Correlated Heavy Hitters from a two-dimensional data strea...
Part 1: Cross-Domain Conference and Workshop on Multidisciplinary Research and Practice for Informat...
6 pagesInternational audienceIn this paper, we present a new approach relevant to the discovery of c...
16 pagesInternational audienceIn this paper, we present a new approach relevant to the discovery of ...
In this paper, two concepts are introduced: decision correlation rules and contingency vectors. The ...
International audienceGradual patterns highlight complex order correlations of the form "The more/le...
We study mining correlations from quantitative databases and show that this is a more effective appr...
One of the important problems in data mining is discovering association rules from databases of tran...
The problem of mining hidden associations present in the large amounts of data has seen widespread a...
The amount of data produced by ubiquitous computing applications is quickly growing, due to the perv...
Abstract. To mine databases in which examples are tagged with class labels, the minimum correlation ...
Pattern mining has been a hot issue since it was first proposed for market basket analysis. Even tho...
Data mining is an emerging research area, whose goal is to extract significant patterns or interesti...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 1998. Simultaneously published...
Data mining is an emerging research area, whose goal is to extract significant patterns or interesti...
We present parallel algorithms for mining Correlated Heavy Hitters from a two-dimensional data strea...