A probability distribution over an ordered universe [n] = {1,..., n} is said to be a k-histogram if it can be represented as a piecewise-constant function over at most k contiguous intervals. We study the following question: given samples from an arbitrary distribution D over [n], one must decide whether D is a k-histogram, or is far in `1 distance from any such succinct representation. We obtain a sample and time-efficient algorithm for this problem, complemented by a nearly-matching information-theoretic lower bound on the number of samples required for this task. Our results significantly improve on the previous state-of-the-art, due to Indyk, Levi, and Rubinfeld [ILR12] and Canonne, Diakonikolas, Gouleakis, and Rubinfeld [CDGR15]
In this work, we consider the problems of testing whether a distribution over {0, 1} n is k-wise (re...
We consider the problem of testing a basic property of collections of distributions: having similar ...
International audienceWhat advantage do sequential procedures provide over batch algorithms for test...
A discrete distribution p, over [n], is a k histogram if its probability distribution function can b...
Histograms are among the most popular structures for the succinct summarization of data in a variety...
We investigate the problem of testing the equivalence between two discrete histograms. A k-histogram...
A natural way to estimate the probability density function of an unknown distribution from the sampl...
Histograms are convenient non-parametric density estimators, which continue to be used ubiquitously....
Let p be an unknown and arbitrary probability distribution over [0, 1). We con-sider the problem of ...
Let p be an unknown and arbitrary probability distribution over [0, 1). We con-sider the problem of ...
We demonstrate that gaps and distributional patterns embedded within real-valued measurements are in...
We demonstrate that gaps and distributional patterns embedded within real-valued measurements are in...
We demonstrate that gaps and distributional patterns embedded within real-valued measurements are in...
We demonstrate that gaps and distributional patterns embedded within real-valued measurements are in...
We demonstrate that gaps and distributional patterns embedded within real-valued measurements are in...
In this work, we consider the problems of testing whether a distribution over {0, 1} n is k-wise (re...
We consider the problem of testing a basic property of collections of distributions: having similar ...
International audienceWhat advantage do sequential procedures provide over batch algorithms for test...
A discrete distribution p, over [n], is a k histogram if its probability distribution function can b...
Histograms are among the most popular structures for the succinct summarization of data in a variety...
We investigate the problem of testing the equivalence between two discrete histograms. A k-histogram...
A natural way to estimate the probability density function of an unknown distribution from the sampl...
Histograms are convenient non-parametric density estimators, which continue to be used ubiquitously....
Let p be an unknown and arbitrary probability distribution over [0, 1). We con-sider the problem of ...
Let p be an unknown and arbitrary probability distribution over [0, 1). We con-sider the problem of ...
We demonstrate that gaps and distributional patterns embedded within real-valued measurements are in...
We demonstrate that gaps and distributional patterns embedded within real-valued measurements are in...
We demonstrate that gaps and distributional patterns embedded within real-valued measurements are in...
We demonstrate that gaps and distributional patterns embedded within real-valued measurements are in...
We demonstrate that gaps and distributional patterns embedded within real-valued measurements are in...
In this work, we consider the problems of testing whether a distribution over {0, 1} n is k-wise (re...
We consider the problem of testing a basic property of collections of distributions: having similar ...
International audienceWhat advantage do sequential procedures provide over batch algorithms for test...