Given samples from two distributions over an n-element set, we wish to test whether these distributions are statistically close. We present an algorithm which uses sublinear in n, specifically, O(n[superscript 2/3]ε[superscript −8/3] log n), independent samples from each distribution, runs in time linear in the sample size, makes no assumptions about the structure of the distributions, and distinguishes the cases when the distance between the distributions is small (less than {ε[superscript 4/3]n[superscript −1/3]/32, εn[superscript −1/2]/4}) or large (more than ε) in ℓ[subscript 1] distance. This result can be compared to the lower bound of Ω(n[superscript 2/3]ε[superscript −2/3]) for this problem given by Valiant [2008]. Our algorithm ha...
We investigate the problem of testing the equivalence between two discrete histograms. A k-histogram...
Abstract—Outlier detection is the problem of finding a few different distributions in a set of mostl...
© Copyright 2018 by SIAM. Given samples from an unknown distribution p and a description of a distri...
Given samples from two distributions over an n-element set, we wish to test whether these distributi...
Given samples from two distributions over an n-element set, we wish to test whether these distributi...
Given samples from two distributions over an $n$-element set, we wish to test whether these distribu...
Given two distributions over an n element set, we wish to check whether these distributions are stat...
Given two distributions over an n element set, we wish to check whether these distributions are stat...
Given two distributions over an n element set, we wish to check whether these distributions are stat...
We study the question of closeness testing for two discrete distributions. More precisely, given sam...
International audienceWhat advantage do sequential procedures provide over batch algorithms for test...
International audienceWhat advantage do sequential procedures provide over batch algorithms for test...
We consider the problem of testing a basic property of collections of distributions: having similar ...
96 pagesWe consider the problem of how to construct algorithms which deal efficiently with large amo...
In this doctoral thesis we consider various property testing problems for structured distributions....
We investigate the problem of testing the equivalence between two discrete histograms. A k-histogram...
Abstract—Outlier detection is the problem of finding a few different distributions in a set of mostl...
© Copyright 2018 by SIAM. Given samples from an unknown distribution p and a description of a distri...
Given samples from two distributions over an n-element set, we wish to test whether these distributi...
Given samples from two distributions over an n-element set, we wish to test whether these distributi...
Given samples from two distributions over an $n$-element set, we wish to test whether these distribu...
Given two distributions over an n element set, we wish to check whether these distributions are stat...
Given two distributions over an n element set, we wish to check whether these distributions are stat...
Given two distributions over an n element set, we wish to check whether these distributions are stat...
We study the question of closeness testing for two discrete distributions. More precisely, given sam...
International audienceWhat advantage do sequential procedures provide over batch algorithms for test...
International audienceWhat advantage do sequential procedures provide over batch algorithms for test...
We consider the problem of testing a basic property of collections of distributions: having similar ...
96 pagesWe consider the problem of how to construct algorithms which deal efficiently with large amo...
In this doctoral thesis we consider various property testing problems for structured distributions....
We investigate the problem of testing the equivalence between two discrete histograms. A k-histogram...
Abstract—Outlier detection is the problem of finding a few different distributions in a set of mostl...
© Copyright 2018 by SIAM. Given samples from an unknown distribution p and a description of a distri...