Motivated by scenarios in network anomaly detection, we consider the problem of detecting persistent items in a data stream, which are items that occur ‘regularly’ in the stream. In contrast with heavy hitters, persistent items do not necessarily contribute significantly to the volume of a stream, and may escape detection by traditional volume‐based anomaly detectors. We first show that any online algorithm that tracks persistent items exactly must necessarily use a large workspace, and is infeasible to run on a traffic monitoring node. In light of this lower bound, we introduce an approximate formulation of the problem and present a small‐space algorithm to approximately track persistent items over a large data stream. We experimented with...
Abstract. Efficient processing of complex streaming data presents multiple chal-lenges, especially w...
[[abstract]]Mining frequent itemsets has been widely studied over the last decade. Past research foc...
Information-theoretic metrics hold great promise for modeling traffic and detecting anomalies if onl...
A persistent item in a stream is one that occurs regularly in the stream without necessarily contrib...
We propose in this paper an on-line algorithm based on Bloom filters for identifying large flows in ...
International audienceWe investigate the problem of estimating on the fly the frequency at which ite...
This paper addresses a major challenge in data mining applications where the full information about ...
© 2019 Milad ChenaghlouData stream clustering and anomaly detection have grown in importance with th...
The problem of detecting frequent items in streaming data is relevant to many different applications...
© 2015 Dr. Mahsa SalehiAnomaly detection in data streams plays a vital role in on-line data mining a...
International audienceContinuous, dynamic and short-term learning is an effective learning strategy ...
As the number of cyber-attacks increases, there has been increasing emphasis on developing complemen...
Abstract Mining frequent itemsets in a datastream proves to be a difficult problem, as itemsets arri...
Dynamic networks, also called network streams, are an im-portant data representation that applies to...
We study the problem of finding frequent itemsets in a continuous stream of transactions. The curren...
Abstract. Efficient processing of complex streaming data presents multiple chal-lenges, especially w...
[[abstract]]Mining frequent itemsets has been widely studied over the last decade. Past research foc...
Information-theoretic metrics hold great promise for modeling traffic and detecting anomalies if onl...
A persistent item in a stream is one that occurs regularly in the stream without necessarily contrib...
We propose in this paper an on-line algorithm based on Bloom filters for identifying large flows in ...
International audienceWe investigate the problem of estimating on the fly the frequency at which ite...
This paper addresses a major challenge in data mining applications where the full information about ...
© 2019 Milad ChenaghlouData stream clustering and anomaly detection have grown in importance with th...
The problem of detecting frequent items in streaming data is relevant to many different applications...
© 2015 Dr. Mahsa SalehiAnomaly detection in data streams plays a vital role in on-line data mining a...
International audienceContinuous, dynamic and short-term learning is an effective learning strategy ...
As the number of cyber-attacks increases, there has been increasing emphasis on developing complemen...
Abstract Mining frequent itemsets in a datastream proves to be a difficult problem, as itemsets arri...
Dynamic networks, also called network streams, are an im-portant data representation that applies to...
We study the problem of finding frequent itemsets in a continuous stream of transactions. The curren...
Abstract. Efficient processing of complex streaming data presents multiple chal-lenges, especially w...
[[abstract]]Mining frequent itemsets has been widely studied over the last decade. Past research foc...
Information-theoretic metrics hold great promise for modeling traffic and detecting anomalies if onl...