An efficient parallel algorithm FPM(Fast Parallel Mining) for mining association rules on a shared-nothing parallel system has been proposed. It adopts the count distribution approach and has incorporated two powerful candidate pruning techniques, i.e., distributed pruning and global pruning. It has a simple communication scheme which performs only one round of message exchange in each iteration. We found that the two pruning techniques are very sensitive to data skewness, which describes the degree of non-uniformity of the itemset distribution among the database partitions. Distributed pruning is very effective when data skewness is high. Global pruning is more effective than distributed pruning even for the mild data skewness case. We hav...
The problem of mining hidden associations present in the large amounts of data has seen widespread a...
Frequent Itemsets and Association Rules Mining (FIM) is a key task in knowledge discovery from data....
The discovery of interesting patterns from database transactions is one of the major problems in kno...
Association rule mining is an important new problem in data mining. It has crucial applications in d...
Abstract: The existence of many large transactions distributed databases with high data schemas, the...
In this paper we propose two new parallel formulations of the Apriori algorithm that is used for com...
Data mining is an emerging research area, whose goal is to extract significant patterns or interesti...
Mining association rules from large databases is an important problem in data mining. There is a nee...
Data mining is an emerging research area, whose goal is to extract significant patterns or interesti...
All current parallel algorithms for mining the association rules follow the conventional level-wise ...
Mining association rules from large databases is very costly. We propose to develop parallel algorit...
Association rule discovery techniques have gradually been adapted to parallel systems in order to t...
With the existence of many large transaction databases, the huge amounts of data, the high scalabili...
Association rule mining recently attracted strong attention. Usually, the classification hierarchy o...
In this paper we consider the problem of parallel mining of association rules on a shared-memory mul...
The problem of mining hidden associations present in the large amounts of data has seen widespread a...
Frequent Itemsets and Association Rules Mining (FIM) is a key task in knowledge discovery from data....
The discovery of interesting patterns from database transactions is one of the major problems in kno...
Association rule mining is an important new problem in data mining. It has crucial applications in d...
Abstract: The existence of many large transactions distributed databases with high data schemas, the...
In this paper we propose two new parallel formulations of the Apriori algorithm that is used for com...
Data mining is an emerging research area, whose goal is to extract significant patterns or interesti...
Mining association rules from large databases is an important problem in data mining. There is a nee...
Data mining is an emerging research area, whose goal is to extract significant patterns or interesti...
All current parallel algorithms for mining the association rules follow the conventional level-wise ...
Mining association rules from large databases is very costly. We propose to develop parallel algorit...
Association rule discovery techniques have gradually been adapted to parallel systems in order to t...
With the existence of many large transaction databases, the huge amounts of data, the high scalabili...
Association rule mining recently attracted strong attention. Usually, the classification hierarchy o...
In this paper we consider the problem of parallel mining of association rules on a shared-memory mul...
The problem of mining hidden associations present in the large amounts of data has seen widespread a...
Frequent Itemsets and Association Rules Mining (FIM) is a key task in knowledge discovery from data....
The discovery of interesting patterns from database transactions is one of the major problems in kno...