The tasks of extracting (top-K) Frequent Itemsets (FI’s) and Association Rules (AR’s) are fundamental primitives in data mining and database applications. Exact algorithms for these problems exist and are widely used, but their running time is hindered by the need of scanning the entire dataset, possibly multiple times. High quality approximations of FI’s and AR’s are sufficient for most practical uses, and a number of recent works explored the application of sampling for fast discovery of approximate solutions to the problems. However, these works do not provide satisfactory performance guarantees on the quality of the approximation, due to the difficulty of bounding the probability of under- or over-sampling any one of an unknown number o...
Abstract—Sampling-based methods have previously been pro-posed for the problem of finding interestin...
Association rules may be used to represent regular patterns in databases for the purpose of decision...
. Association rule discovery is one of the prototypical problems in data mining. In this problem, th...
The tasks of extracting (top-K) Frequent Itemsets (FI’s) and Association Rules (AR’s) are fundamenta...
Abstract. We study the use of sampling for efficiently mining the top-K frequent itemsets of cardina...
Analyzing huge datasets becomes prohibitively slow when the dataset does not fit in main memory. App...
We study the use of sampling for efficiently mining the top-K frequent itemsets of cardinality at m...
We present an algorithm to extract an high-quality approximation of the (top-k) Frequent itemsets (F...
Abstract. Sampling a dataset for faster analysis and looking at it as a sample from an unknown distr...
Frequent Itemsets (FIs) mining is a fundamental primitive in knowledge discovery. It requires to ide...
Consistent sampling is a technique for specifying, in small space, a subset S of a potentially large...
Abstract. Consistent sampling is a technique for specifying, in small space, a subset S of a potenti...
Association rule discovery is one of the prototypical problems in data mining. In this problem, the...
Abstract—In classical association rules mining, a minimum support threshold is assumed to be availab...
Abstract—In classical association rules mining, a minimum support threshold is assumed to be availab...
Abstract—Sampling-based methods have previously been pro-posed for the problem of finding interestin...
Association rules may be used to represent regular patterns in databases for the purpose of decision...
. Association rule discovery is one of the prototypical problems in data mining. In this problem, th...
The tasks of extracting (top-K) Frequent Itemsets (FI’s) and Association Rules (AR’s) are fundamenta...
Abstract. We study the use of sampling for efficiently mining the top-K frequent itemsets of cardina...
Analyzing huge datasets becomes prohibitively slow when the dataset does not fit in main memory. App...
We study the use of sampling for efficiently mining the top-K frequent itemsets of cardinality at m...
We present an algorithm to extract an high-quality approximation of the (top-k) Frequent itemsets (F...
Abstract. Sampling a dataset for faster analysis and looking at it as a sample from an unknown distr...
Frequent Itemsets (FIs) mining is a fundamental primitive in knowledge discovery. It requires to ide...
Consistent sampling is a technique for specifying, in small space, a subset S of a potentially large...
Abstract. Consistent sampling is a technique for specifying, in small space, a subset S of a potenti...
Association rule discovery is one of the prototypical problems in data mining. In this problem, the...
Abstract—In classical association rules mining, a minimum support threshold is assumed to be availab...
Abstract—In classical association rules mining, a minimum support threshold is assumed to be availab...
Abstract—Sampling-based methods have previously been pro-posed for the problem of finding interestin...
Association rules may be used to represent regular patterns in databases for the purpose of decision...
. Association rule discovery is one of the prototypical problems in data mining. In this problem, th...