We present a refined analysis of the classic Count-Sketch streaming heavy hitters algo-rithm [CCF02]. Count-Sketch uses O(k log n) linear measurements of a vector x ∈ Rn to give an estimate x ̂ of x. The standard analysis shows that this estimate x ̂ satisfies ‖x̂−x‖2 ∞ < ‖x[k]‖22/k, where x [k] is the vector containing all but the largest k coordinates of x. Our main result is that most of the coordinates of x ̂ have substantially less error than this upper bound; namely, for any c < O(log n), we show that each coordinate i satisfies (x̂i − xi)2 < c log
We consider the problem of approximate range counting over streams of d-dimensional points. In the d...
We give the first L1-sketching algorithm for integer vectors which produces nearly optimal sized ske...
The task of finding heavy hitters is one of the best known and well studied problems in the area of ...
We present a refined analysis of the classic Count-Sketch streaming heavy hitters algo-rithm [CCF02]...
The editor's version is available for free until November 03, 2022: https://authors.elsevier.com/c/1...
International audienceCount-Min Sketch with Conservative Updates (CMS-CU) is a popular algorithm to ...
The Count-Min (CM) Sketch is a compact summary data structure capable of representing a high-dimensi...
In the recent years linear sketching has emerged as a powerful tool for approximate computing in set...
Streaming algorithms, which process very large datasets received one update at a time, are a key too...
Abstract — We introduce a new method of data collection for flow size estimation, the optimized flow...
Count-min is a general-purpose data stream summary technique, which can be used to answer multiple t...
Many NLP tasks rely on accurate statis-tics from large corpora. Tracking com-plete statistics is mem...
The count-min sketch is a useful data structure for recording and estimating the frequency of string...
Real-time data stream processing is key to many Internet applications ranging from e-commerce, socia...
Unlike estimating the norm of a vector in a stream, the memory required for estimating the norm of a...
We consider the problem of approximate range counting over streams of d-dimensional points. In the d...
We give the first L1-sketching algorithm for integer vectors which produces nearly optimal sized ske...
The task of finding heavy hitters is one of the best known and well studied problems in the area of ...
We present a refined analysis of the classic Count-Sketch streaming heavy hitters algo-rithm [CCF02]...
The editor's version is available for free until November 03, 2022: https://authors.elsevier.com/c/1...
International audienceCount-Min Sketch with Conservative Updates (CMS-CU) is a popular algorithm to ...
The Count-Min (CM) Sketch is a compact summary data structure capable of representing a high-dimensi...
In the recent years linear sketching has emerged as a powerful tool for approximate computing in set...
Streaming algorithms, which process very large datasets received one update at a time, are a key too...
Abstract — We introduce a new method of data collection for flow size estimation, the optimized flow...
Count-min is a general-purpose data stream summary technique, which can be used to answer multiple t...
Many NLP tasks rely on accurate statis-tics from large corpora. Tracking com-plete statistics is mem...
The count-min sketch is a useful data structure for recording and estimating the frequency of string...
Real-time data stream processing is key to many Internet applications ranging from e-commerce, socia...
Unlike estimating the norm of a vector in a stream, the memory required for estimating the norm of a...
We consider the problem of approximate range counting over streams of d-dimensional points. In the d...
We give the first L1-sketching algorithm for integer vectors which produces nearly optimal sized ske...
The task of finding heavy hitters is one of the best known and well studied problems in the area of ...