International audienceWe introduce TopPI, a new semantics and algorithm designed to mine long-tailed datasets. For each item, and regardless of its frequency, TopPI finds the k most frequent closed itemsets that item belongs to. For example, in our retail dataset, TopPI finds the itemset " nori seaweed, wasabi, sushi rice, soy sauce " that occurrs in only 133 store receipts out of 290 million. It also finds the itemset " milk, puff pastry " , that appears 152,991 times. Thanks to a dynamic threshold adjustment and an adequate pruning strategy, TopPI efficiently traverses the relevant parts of the search space and can be parallelized on multi-cores. Our experiments on datasets with different characteristics show the high performance of TopPI...
Frequent closed itemset mining is among the most complex exploratory techniques in data mining, and ...
This paper presents a new scalable algorithm for discovering closed frequent itemsets, which are a l...
Frequent itemset mining is today one of the most popular data mining techniques. Its application is,...
International audienceWe introduce TopPI, a new semantics and algorithm designed to mine long-tailed...
International audienceIn this paper, we introduce item-centric mining, a new semantics for mining lo...
The recent increase of data volumes raises new challenges for itemset mining algorithms. In this the...
Association rules show strong relationship between attribute-value pairs (or items) that occur frequ...
Frequent itemset mining is an exploratory data mining technique that has fruitfully been exploited t...
This paper presents a new scalable algorithm for discovering closed frequent itemsets, a lossless an...
In today’s world, large volumes of data are being continuously generated by many scientific applicat...
Les algorithmes actuels pour la fouille d’ensembles fréquents sont dépassés par l’augmentation des v...
In this work we study the mining of top-$K$ frequent closed itemsets, a recently proposed variant o...
International audienceAssociation rule discovery based on support-confidence frame-work is an import...
International audienceMining big datasets poses a number of challenges which are not easily addresse...
Frequent closed itemset mining is among the most complex exploratory techniques in data mining, and ...
This paper presents a new scalable algorithm for discovering closed frequent itemsets, which are a l...
Frequent itemset mining is today one of the most popular data mining techniques. Its application is,...
International audienceWe introduce TopPI, a new semantics and algorithm designed to mine long-tailed...
International audienceIn this paper, we introduce item-centric mining, a new semantics for mining lo...
The recent increase of data volumes raises new challenges for itemset mining algorithms. In this the...
Association rules show strong relationship between attribute-value pairs (or items) that occur frequ...
Frequent itemset mining is an exploratory data mining technique that has fruitfully been exploited t...
This paper presents a new scalable algorithm for discovering closed frequent itemsets, a lossless an...
In today’s world, large volumes of data are being continuously generated by many scientific applicat...
Les algorithmes actuels pour la fouille d’ensembles fréquents sont dépassés par l’augmentation des v...
In this work we study the mining of top-$K$ frequent closed itemsets, a recently proposed variant o...
International audienceAssociation rule discovery based on support-confidence frame-work is an import...
International audienceMining big datasets poses a number of challenges which are not easily addresse...
Frequent closed itemset mining is among the most complex exploratory techniques in data mining, and ...
This paper presents a new scalable algorithm for discovering closed frequent itemsets, which are a l...
Frequent itemset mining is today one of the most popular data mining techniques. Its application is,...