International audienceFrequent itemset mining (FIM) is one of the fundamental cornerstones in data mining. While, the problem of FIM has been thoroughly studied, few of both standard and improved solutions scale. This is mainly the case when i) the amount of data tends to be very large and/or ii) the minimum support (M inSup) threshold is very low. In this paper, we propose a highly scalable, parallel frequent itemset mining (PFIM) algorithm, namely Parallel Absolute Top Down (PATD). PATD algorithm renders the mining process of very large databases (up to Ter-abytes of data) simple and compact. Its mining process is made up of only one parallel job, which dramatically reduces the mining runtime, the communication cost and the energy power c...
Itemset mining is a well-known exploratory data mining technique used to discover interesting correl...
An efficient parallel algorithm FPM(Fast Parallel Mining) for mining association rules on a shared-n...
Data mining is an emerging research area, whose goal is to discover potentially useful information e...
International audienceFrequent itemset mining presents one of the fundamental building blocks in dat...
International audienceDespite crucial recent advances, the problem of frequent itemset mining is sti...
Frequent Itemsets Mining (FIM) is a fundamental mining model and plays an important role in Data Min...
Itemset mining is a well-known exploratory technique used to discover interesting correlations hidde...
In recent years, knowledge discovery in databases provides a powerful capability to discover meaning...
International audienceData analytics in general, and data mining primitives in particular , are a ma...
Recently, several algorithms based on the MapReduce framework have been proposed for frequent patter...
In this paper, we propose an algorithm to partition both the search space and the database for the p...
Frequent itemset mining is an important building block in many data mining applications like market ...
Abstract Traditional methods for data mining typically make the assumption that data is centralized ...
Traditional methods for frequent itemset mining typically assume that data is centralized and static...
Part 4: Session 4: Multi-core Computing and GPUInternational audienceFrequent Itemset Mining (FIM) i...
Itemset mining is a well-known exploratory data mining technique used to discover interesting correl...
An efficient parallel algorithm FPM(Fast Parallel Mining) for mining association rules on a shared-n...
Data mining is an emerging research area, whose goal is to discover potentially useful information e...
International audienceFrequent itemset mining presents one of the fundamental building blocks in dat...
International audienceDespite crucial recent advances, the problem of frequent itemset mining is sti...
Frequent Itemsets Mining (FIM) is a fundamental mining model and plays an important role in Data Min...
Itemset mining is a well-known exploratory technique used to discover interesting correlations hidde...
In recent years, knowledge discovery in databases provides a powerful capability to discover meaning...
International audienceData analytics in general, and data mining primitives in particular , are a ma...
Recently, several algorithms based on the MapReduce framework have been proposed for frequent patter...
In this paper, we propose an algorithm to partition both the search space and the database for the p...
Frequent itemset mining is an important building block in many data mining applications like market ...
Abstract Traditional methods for data mining typically make the assumption that data is centralized ...
Traditional methods for frequent itemset mining typically assume that data is centralized and static...
Part 4: Session 4: Multi-core Computing and GPUInternational audienceFrequent Itemset Mining (FIM) i...
Itemset mining is a well-known exploratory data mining technique used to discover interesting correl...
An efficient parallel algorithm FPM(Fast Parallel Mining) for mining association rules on a shared-n...
Data mining is an emerging research area, whose goal is to discover potentially useful information e...