We present FDCMSS, a new sketch-based algorithm for mining frequent items in data streams. The algorithm cleverly combines key ideas borrowed from forward decay, the Count-Min and the Space Saving algorithms. It works in the time fading model, mining data streams according to the cash register model. We formally prove its correctness and show, through extensive experimental results, that our algorithm outperforms λ-HCount, a recently developed algorithm, with regard to speed, space used, precision attained and error committed on both synthetic and real datasets
Mining frequent itemsets from transactional data streams is challenging due to the nature of the exp...
International audienceWe investigate the problem of estimating on the fly the frequency at which ite...
Abstract Mining frequent itemsets over a stream of transactions presents di cult new challenges over...
We deal with the problem of detecting frequent items in a stream under the constraint that items are...
Abstract Mining frequent itemsets in a datastream proves to be a difficult problem, as itemsets arri...
The frequent items problem is to process a stream of items and find all items occurring more than a ...
In this paper we present PFDCMSS (Parallel Forward Decay Count-Min Space Saving) which, to the best ...
The problem of detecting frequent items in streaming data is relevant to many different applications...
The increasing prominence of data streams arising in a wide range of advanced applications such as f...
Mining frequent itemsets in a datastream proves to be a difficult problem, as itemsets arrive in rap...
The increasing prominence of data streams arising in a wide range of advanced applications such as f...
[[abstract]]Mining frequent itemsets has been widely studied over the last decade. Past research foc...
We investigate the problem of frequent itemset mining over a data stream with bursty traffic. In man...
The increasing importance of data stream arising in a wide range of advanced applications has led to...
AbstractWe present a 1-pass algorithm for estimating the most frequent items in a data stream using ...
Mining frequent itemsets from transactional data streams is challenging due to the nature of the exp...
International audienceWe investigate the problem of estimating on the fly the frequency at which ite...
Abstract Mining frequent itemsets over a stream of transactions presents di cult new challenges over...
We deal with the problem of detecting frequent items in a stream under the constraint that items are...
Abstract Mining frequent itemsets in a datastream proves to be a difficult problem, as itemsets arri...
The frequent items problem is to process a stream of items and find all items occurring more than a ...
In this paper we present PFDCMSS (Parallel Forward Decay Count-Min Space Saving) which, to the best ...
The problem of detecting frequent items in streaming data is relevant to many different applications...
The increasing prominence of data streams arising in a wide range of advanced applications such as f...
Mining frequent itemsets in a datastream proves to be a difficult problem, as itemsets arrive in rap...
The increasing prominence of data streams arising in a wide range of advanced applications such as f...
[[abstract]]Mining frequent itemsets has been widely studied over the last decade. Past research foc...
We investigate the problem of frequent itemset mining over a data stream with bursty traffic. In man...
The increasing importance of data stream arising in a wide range of advanced applications has led to...
AbstractWe present a 1-pass algorithm for estimating the most frequent items in a data stream using ...
Mining frequent itemsets from transactional data streams is challenging due to the nature of the exp...
International audienceWe investigate the problem of estimating on the fly the frequency at which ite...
Abstract Mining frequent itemsets over a stream of transactions presents di cult new challenges over...