We present a deterministic parallel algorithm for the k-majority problem, that can be used to find in parallel frequent items, i.e. those whose multiplicity is greater than a given threshold, and is therefore useful to process iceberg queries and in many other different contexts of applied mathematics and information theory. The algorithm can be used both in the online (stream) context and in the offline setting, the difference being that in the former case we are restricted to a single scan of the input elements, so that verifying the frequent items that have been determined is not allowed (e.g. network traffic streams passing through internet routers), while in the latter a parallel scan of the input can be used to determine the actual k-...
Due to the huge increase in the number and dimension of available databases, efficient solutions for...
We present scalable parallel algorithms with sublinear per-processor communication volume and low la...
Data mining is an emerging research area, whose goal is to discover potentially useful information e...
Recently, several algorithms based on the MapReduce framework have been proposed for frequent patter...
International audienceThe problem of closed frequent itemset discovery is a fundamental problem of d...
Given an array A of n elements and a value 2≤k≤n, a frequent item or k-majority element is an elemen...
We present a message-passing based parallel version of the Space Saving algorithm designed to solve...
The frequent items problem is to process a stream of items and find all items occurring more than a ...
We study the problem of finding the k most frequent items in a stream of items for the recently prop...
We present a survey of the most important algorithms that have been pro- posed in the context of the...
We present a message-passing based parallel version of the Space Saving algorithm designed to solve ...
In this paper we present DCI, a new data mining algorithm for frequent set counting. We also discuss...
Apriori Algorithms are used on very large data sets with high dimensionality. Therefore parallel com...
The frequent elements problem involves processing a stream of elements and finding all elements that...
Count queries belong to a class of summary statistics routinely used in basket analysis, inventor...
Due to the huge increase in the number and dimension of available databases, efficient solutions for...
We present scalable parallel algorithms with sublinear per-processor communication volume and low la...
Data mining is an emerging research area, whose goal is to discover potentially useful information e...
Recently, several algorithms based on the MapReduce framework have been proposed for frequent patter...
International audienceThe problem of closed frequent itemset discovery is a fundamental problem of d...
Given an array A of n elements and a value 2≤k≤n, a frequent item or k-majority element is an elemen...
We present a message-passing based parallel version of the Space Saving algorithm designed to solve...
The frequent items problem is to process a stream of items and find all items occurring more than a ...
We study the problem of finding the k most frequent items in a stream of items for the recently prop...
We present a survey of the most important algorithms that have been pro- posed in the context of the...
We present a message-passing based parallel version of the Space Saving algorithm designed to solve ...
In this paper we present DCI, a new data mining algorithm for frequent set counting. We also discuss...
Apriori Algorithms are used on very large data sets with high dimensionality. Therefore parallel com...
The frequent elements problem involves processing a stream of elements and finding all elements that...
Count queries belong to a class of summary statistics routinely used in basket analysis, inventor...
Due to the huge increase in the number and dimension of available databases, efficient solutions for...
We present scalable parallel algorithms with sublinear per-processor communication volume and low la...
Data mining is an emerging research area, whose goal is to discover potentially useful information e...