Frequency estimation data structures such as the count-min sketch (CMS) have found numerous applications in databases, networking, computational biology and other domains. Many applications that use the count-min sketch process massive and rapidly evolving data sets. For data-intensive applications that aim to keep the overestimate error low, the count-min sketch becomes too large to store in available RAM and may have to migrate to external storage (e.g., SSD.) Due to the random-read/write nature of hash operations of the count-min sketch, simply placing it on SSD stifles the performance of time-critical applications, requiring about 4-6 random reads/writes to SSD per estimate (lookup) and update (insert) operation. In this paper, we expan...
The count-min sketch (CMS) is a randomized data structure that provides estimates of tokens’ freque...
Maintaining frequency counts for data streams has attracted much interest among the research communi...
The editor's version is available for free until November 03, 2022: https://authors.elsevier.com/c/1...
Sketches are probabilistic data structures that can provide approx- imate results within mathematica...
International audienceConservative Count-Min, a stronger version of the popular Count-Min sketch [Co...
The Count-Min sketch is the most popular data structure for flow size estimation, a basic measuremen...
Count-min is a general-purpose data stream summary technique, which can be used to answer multiple t...
The estimation of the frequency of the elements on a set is needed in a wide range of computing appl...
The estimation of the frequency of the elements on a set is needed in a wide range of computing appl...
A flexible conformal inference method is developed to construct confidence intervals for the frequen...
A sketch is a probabilistic data structure that is used to record frequencies of items in a multi-se...
International audienceWe investigate the problem of estimating on the fly the frequency at which ite...
The Count-Min (CM) Sketch is a compact summary data structure capable of representing a high-dimensi...
16 pagesIn this paper, we investigate the problem of estimating the number of times data items that ...
We present FDCMSS, a new sketch-based algorithm for mining frequent items in data streams. The algor...
The count-min sketch (CMS) is a randomized data structure that provides estimates of tokens’ freque...
Maintaining frequency counts for data streams has attracted much interest among the research communi...
The editor's version is available for free until November 03, 2022: https://authors.elsevier.com/c/1...
Sketches are probabilistic data structures that can provide approx- imate results within mathematica...
International audienceConservative Count-Min, a stronger version of the popular Count-Min sketch [Co...
The Count-Min sketch is the most popular data structure for flow size estimation, a basic measuremen...
Count-min is a general-purpose data stream summary technique, which can be used to answer multiple t...
The estimation of the frequency of the elements on a set is needed in a wide range of computing appl...
The estimation of the frequency of the elements on a set is needed in a wide range of computing appl...
A flexible conformal inference method is developed to construct confidence intervals for the frequen...
A sketch is a probabilistic data structure that is used to record frequencies of items in a multi-se...
International audienceWe investigate the problem of estimating on the fly the frequency at which ite...
The Count-Min (CM) Sketch is a compact summary data structure capable of representing a high-dimensi...
16 pagesIn this paper, we investigate the problem of estimating the number of times data items that ...
We present FDCMSS, a new sketch-based algorithm for mining frequent items in data streams. The algor...
The count-min sketch (CMS) is a randomized data structure that provides estimates of tokens’ freque...
Maintaining frequency counts for data streams has attracted much interest among the research communi...
The editor's version is available for free until November 03, 2022: https://authors.elsevier.com/c/1...