The tasks of extracting (top-K) Frequent Itemsets (FI’s) and Association Rules (AR’s) are fundamental primitives in data mining and database applications. Exact algorithms for these problems exist and are widely used, but their running time is hindered by the need of scanning the entire dataset, possibly mul-tiple times. High quality approximations of FI’s and AR’s are sufficient for most practical uses. Sampling techniques can be used for fast discovery of approximate solutions, but works exploring this technique did not provide satisfactory performance guarantees on the quality of the approximation, due to the difficulty of bounding the probability of under- or over-sampling any one of an unknown number of frequent itemsets. We circumvent...
Abstract—In classical association rules mining, a minimum support threshold is assumed to be availab...
Abstract—In classical association rules mining, a minimum support threshold is assumed to be availab...
Abstract. Consistent sampling is a technique for specifying, in small space, a subset S of a potenti...
The tasks of extracting (top-K) Frequent Itemsets (FI’s) and Association Rules (AR’s) are fundamenta...
Abstract. We study the use of sampling for efficiently mining the top-K frequent itemsets of cardina...
Analyzing huge datasets becomes prohibitively slow when the dataset does not fit in main memory. App...
We study the use of sampling for efficiently mining the top-K frequent itemsets of cardinality at m...
We present an algorithm to extract an high-quality approximation of the (top-k) Frequent itemsets (F...
Association rules may be used to represent regular patterns in databases for the purpose of decision...
Frequent Itemsets (FIs) mining is a fundamental primitive in knowledge discovery. It requires to ide...
Abstract. Sampling a dataset for faster analysis and looking at it as a sample from an unknown distr...
Association rule discovery has emerged as an important problem in knowledge discovery and data minin...
. Association rule discovery is one of the prototypical problems in data mining. In this problem, th...
Association rule discovery is one of the prototypical problems in data mining. In this problem, the...
Consistent sampling is a technique for specifying, in small space, a subset S of a potentially large...
Abstract—In classical association rules mining, a minimum support threshold is assumed to be availab...
Abstract—In classical association rules mining, a minimum support threshold is assumed to be availab...
Abstract. Consistent sampling is a technique for specifying, in small space, a subset S of a potenti...
The tasks of extracting (top-K) Frequent Itemsets (FI’s) and Association Rules (AR’s) are fundamenta...
Abstract. We study the use of sampling for efficiently mining the top-K frequent itemsets of cardina...
Analyzing huge datasets becomes prohibitively slow when the dataset does not fit in main memory. App...
We study the use of sampling for efficiently mining the top-K frequent itemsets of cardinality at m...
We present an algorithm to extract an high-quality approximation of the (top-k) Frequent itemsets (F...
Association rules may be used to represent regular patterns in databases for the purpose of decision...
Frequent Itemsets (FIs) mining is a fundamental primitive in knowledge discovery. It requires to ide...
Abstract. Sampling a dataset for faster analysis and looking at it as a sample from an unknown distr...
Association rule discovery has emerged as an important problem in knowledge discovery and data minin...
. Association rule discovery is one of the prototypical problems in data mining. In this problem, th...
Association rule discovery is one of the prototypical problems in data mining. In this problem, the...
Consistent sampling is a technique for specifying, in small space, a subset S of a potentially large...
Abstract—In classical association rules mining, a minimum support threshold is assumed to be availab...
Abstract—In classical association rules mining, a minimum support threshold is assumed to be availab...
Abstract. Consistent sampling is a technique for specifying, in small space, a subset S of a potenti...