In this paper, we propose an algorithm to partition both the search space and the database for the parallel mining of frequent closed itemsets in large databases. The partition-ing of the search space is based on splitting the power set lattice of the total item set to two sub-lattices. Conditional databases are used to partition the large database. The combination of the search space and database partitioning allows parallel processors to mine the frequent closed item-sets independently and thus minimizes the interprocessor communication and synchronization. The partitioning also ensures the load balance among the parallel processors
International audienceFrequent itemset mining presents one of the fundamental building blocks in dat...
Abstract Traditional methods for data mining typically make the assumption that data is centralized ...
Data mining is the exploration and analysis of large quantities of data to discover meaningful patte...
This paper presents a new scalable algorithm for discovering closed frequent itemsets, a lossless an...
Itemset mining is a well-known exploratory technique used to discover interesting correlations hidde...
Frequent Itemsets mining is well explored for various data types, and its computational complexity i...
International audienceFrequent itemset mining (FIM) is one of the fundamental cornerstones in data m...
International audienceWe introduce a transaction database distribution scheme that divides the frequ...
This paper presents a new scalable algorithm for discovering closed frequent itemsets, which are a l...
Frequent itemsets mining is well explored for various data types, and its computational complexity i...
Abstract: The existence of many large transactions distributed databases with high data schemas, the...
Frequent itemset mining is an important building block in many data mining applications like market ...
We introduce a transaction database distribution scheme that divides the frequent item set mining ta...
International audienceThe problem of closed frequent itemset discovery is a fundamental problem of d...
Data mining is proving itself to be a very important fi eld as the data available is increasing expo...
International audienceFrequent itemset mining presents one of the fundamental building blocks in dat...
Abstract Traditional methods for data mining typically make the assumption that data is centralized ...
Data mining is the exploration and analysis of large quantities of data to discover meaningful patte...
This paper presents a new scalable algorithm for discovering closed frequent itemsets, a lossless an...
Itemset mining is a well-known exploratory technique used to discover interesting correlations hidde...
Frequent Itemsets mining is well explored for various data types, and its computational complexity i...
International audienceFrequent itemset mining (FIM) is one of the fundamental cornerstones in data m...
International audienceWe introduce a transaction database distribution scheme that divides the frequ...
This paper presents a new scalable algorithm for discovering closed frequent itemsets, which are a l...
Frequent itemsets mining is well explored for various data types, and its computational complexity i...
Abstract: The existence of many large transactions distributed databases with high data schemas, the...
Frequent itemset mining is an important building block in many data mining applications like market ...
We introduce a transaction database distribution scheme that divides the frequent item set mining ta...
International audienceThe problem of closed frequent itemset discovery is a fundamental problem of d...
Data mining is proving itself to be a very important fi eld as the data available is increasing expo...
International audienceFrequent itemset mining presents one of the fundamental building blocks in dat...
Abstract Traditional methods for data mining typically make the assumption that data is centralized ...
Data mining is the exploration and analysis of large quantities of data to discover meaningful patte...