We investigate the problem of testing the equivalence between two discrete histograms. A k-histogram over [n] is a probability distribution that is piecewise constant over some set of k intervals over [n]. Histograms have been extensively studied in computer science and statistics. Given a set of samples from two k-histogram distributions p, q over [n], we want to distinguish (with high probability) between the cases that p = q and ||p ? q||_1 >= epsilon. The main contribution of this paper is a new algorithm for this testing problem and a nearly matching information-theoretic lower bound. Specifically, the sample complexity of our algorithm matches our lower bound up to a logarithmic factor, improving on previous work by polynomial factor...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
We study the question of testing structured properties (classes) of discrete distributions. Specific...
We study the question of testing structured properties (classes) of discrete distributions. Specific...
A discrete distribution p, over [n], is a k histogram if its probability distribution function can b...
We study the question of closeness testing for two discrete distributions. More precisely, given sam...
A probability distribution over an ordered universe [n] = {1,..., n} is said to be a k-histogram if...
Histograms are among the most popular structures for the succinct summarization of data in a variety...
International audienceWhat advantage do sequential procedures provide over batch algorithms for test...
Given samples from two distributions over an n-element set, we wish to test whether these distributi...
We study the problem of testing discrete distributions with a focus on the high probability regime. ...
International audienceWhat advantage do sequential procedures provide over batch algorithms for test...
Given samples from two distributions over an $n$-element set, we wish to test whether these distribu...
We study the problem of discrete distribution testing in the two-party setting. For example, in the ...
Given samples from two distributions over an n-element set, we wish to test whether these distributi...
Given samples from two distributions over an n-element set, we wish to test whether these distributi...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
We study the question of testing structured properties (classes) of discrete distributions. Specific...
We study the question of testing structured properties (classes) of discrete distributions. Specific...
A discrete distribution p, over [n], is a k histogram if its probability distribution function can b...
We study the question of closeness testing for two discrete distributions. More precisely, given sam...
A probability distribution over an ordered universe [n] = {1,..., n} is said to be a k-histogram if...
Histograms are among the most popular structures for the succinct summarization of data in a variety...
International audienceWhat advantage do sequential procedures provide over batch algorithms for test...
Given samples from two distributions over an n-element set, we wish to test whether these distributi...
We study the problem of testing discrete distributions with a focus on the high probability regime. ...
International audienceWhat advantage do sequential procedures provide over batch algorithms for test...
Given samples from two distributions over an $n$-element set, we wish to test whether these distribu...
We study the problem of discrete distribution testing in the two-party setting. For example, in the ...
Given samples from two distributions over an n-element set, we wish to test whether these distributi...
Given samples from two distributions over an n-element set, we wish to test whether these distributi...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
We study the question of testing structured properties (classes) of discrete distributions. Specific...
We study the question of testing structured properties (classes) of discrete distributions. Specific...