Approximating ranks, quantiles, and distributions over streaming data is a central task in data analysis and monitoring. Given a stream of $n$ items from a data universe $\mathcal{U}$ equipped with a total order, the task is to compute a sketch (data structure) of size poly$(\log(n), 1/\varepsilon)$. Given the sketch and a query item $y \in \mathcal{U}$, one should be able to approximate its rank in the stream, i.e., the number of stream elements smaller than or equal to $y$. Most works to date focused on additive $\varepsilon n$ error approximation, culminating in the KLL sketch that achieved optimal asymptotic behavior. This paper investigates multiplicative $(1\pm\varepsilon)$-error approximations to the rank. Practical motivation for ...
This LNCS vol. is the Proceedings of FAW 2010This paper studies the space complexity of the ε-approx...
As the size of data available for processing increases, new models of computation are needed. This ...
This paper considers the problem of maintaining statistic aggregates over the last W elements of a d...
Approximating ranks, quantiles, and distributions over streaming data is a central task in data anal...
When trying to process a data stream in small space, how important is the order in which the data ar...
Streaming algorithms, which process very large datasets received one update at a time, are a key too...
We present lower bounds on the space required to estimate the quantiles of a stream of numerical val...
When trying to process a data stream in small space, how important is the order in which the data ar...
A fundamental problem in data management and analysis is to generate descriptions of the distributio...
High-volume data streams are too large and grow too quickly to store entirely in working memory, int...
Skew is prevalentin many data sourcessuchas IP traffic streams. To continually summarize the distrib...
Exact solutions are unattainable for important problems. The calculations are limited by the memory ...
We revisit one of the classic problems in the data stream literature, namely, that of estimating the...
We consider computing a longest palindrome in the streaming model, where the symbols arrive one-by-o...
We present UDDSketch (Uniform DDSketch), a novel sketch for fast and accurate tracking of quantiles ...
This LNCS vol. is the Proceedings of FAW 2010This paper studies the space complexity of the ε-approx...
As the size of data available for processing increases, new models of computation are needed. This ...
This paper considers the problem of maintaining statistic aggregates over the last W elements of a d...
Approximating ranks, quantiles, and distributions over streaming data is a central task in data anal...
When trying to process a data stream in small space, how important is the order in which the data ar...
Streaming algorithms, which process very large datasets received one update at a time, are a key too...
We present lower bounds on the space required to estimate the quantiles of a stream of numerical val...
When trying to process a data stream in small space, how important is the order in which the data ar...
A fundamental problem in data management and analysis is to generate descriptions of the distributio...
High-volume data streams are too large and grow too quickly to store entirely in working memory, int...
Skew is prevalentin many data sourcessuchas IP traffic streams. To continually summarize the distrib...
Exact solutions are unattainable for important problems. The calculations are limited by the memory ...
We revisit one of the classic problems in the data stream literature, namely, that of estimating the...
We consider computing a longest palindrome in the streaming model, where the symbols arrive one-by-o...
We present UDDSketch (Uniform DDSketch), a novel sketch for fast and accurate tracking of quantiles ...
This LNCS vol. is the Proceedings of FAW 2010This paper studies the space complexity of the ε-approx...
As the size of data available for processing increases, new models of computation are needed. This ...
This paper considers the problem of maintaining statistic aggregates over the last W elements of a d...