Data streams are usually generated in an online fashion characterized by huge volume, rapid unpredictable rates, and fast changing data characteristics. It has been hence recognized that mining over streaming data requires the problem of limited computational resources to be adequately addressed. Since the arrival rate of data streams can significantly increase and exceed the CPU capacity, the machinery must adapt to this change to guarantee the timeliness of the results. We present an online algorithm to approximate a set of frequent patterns from a sliding window over the underlying data stream - given apriori CPU capacity. The algorithm automatically detects overload situations and can adaptively shed unprocessed data to guarantee the ti...
We investigate the problem of estimating on the fly the frequency at which items recur in large scal...
International audienceMining frequent patterns on streaming data is a new challenging problem for th...
In data stream applications, a good approximation obtained in a timely manner is often better ...
For most data stream applications, the volume of data is too huge to be stored in permanent devices ...
Most algorithms that focus on discovering frequent patterns from data streams assumed that the machi...
Traditional data mining techniques expect all data to be managed within some form of persistent data...
Many critical applications, like intrusion detection or stock market analysis, require a nearly imme...
This paper introduces a new algorithm for approximate mining of frequent patterns from streams of tr...
Abstract. Discovering frequent patterns over event sequences is an important data mining problem. Ex...
Traditional algorithms for frequent itemset discovery are designed for static data. They cannot be s...
We consider a CPU constrained environment for finding approximation of frequent sets in data streams...
Abstract. Recently, the data stream, which is an unbounded sequence of data elements generated at a ...
[[abstract]]Mining frequent itemsets has been widely studied over the last decade. Past research foc...
In this paper, the methods are investigate for online, frequent pattern mining of stream data, with ...
Li GH, Chen H. Mining the frequent patterns in an arbitrary sliding window over online data streams
We investigate the problem of estimating on the fly the frequency at which items recur in large scal...
International audienceMining frequent patterns on streaming data is a new challenging problem for th...
In data stream applications, a good approximation obtained in a timely manner is often better ...
For most data stream applications, the volume of data is too huge to be stored in permanent devices ...
Most algorithms that focus on discovering frequent patterns from data streams assumed that the machi...
Traditional data mining techniques expect all data to be managed within some form of persistent data...
Many critical applications, like intrusion detection or stock market analysis, require a nearly imme...
This paper introduces a new algorithm for approximate mining of frequent patterns from streams of tr...
Abstract. Discovering frequent patterns over event sequences is an important data mining problem. Ex...
Traditional algorithms for frequent itemset discovery are designed for static data. They cannot be s...
We consider a CPU constrained environment for finding approximation of frequent sets in data streams...
Abstract. Recently, the data stream, which is an unbounded sequence of data elements generated at a ...
[[abstract]]Mining frequent itemsets has been widely studied over the last decade. Past research foc...
In this paper, the methods are investigate for online, frequent pattern mining of stream data, with ...
Li GH, Chen H. Mining the frequent patterns in an arbitrary sliding window over online data streams
We investigate the problem of estimating on the fly the frequency at which items recur in large scal...
International audienceMining frequent patterns on streaming data is a new challenging problem for th...
In data stream applications, a good approximation obtained in a timely manner is often better ...