Approximating distributions from their samples is a canonical statistical-learning problem. One of its most powerful and successful modalities approximates every distribution to an $\ell_1$ distance essentially at most a constant times larger than its closest $t$-piece degree-$d$ polynomial, where $t\ge1$ and $d\ge0$. Letting $c_{t,d}$ denote the smallest such factor, clearly $c_{1,0}=1$, and it can be shown that $c_{t,d}\ge 2$ for all other $t$ and $d$. Yet current computationally efficient algorithms show only $c_{t,1}\le 2.25$ and the bound rises quickly to $c_{t,d}\le 3$ for $d\ge 9$. We derive a near-linear-time and essentially sample-optimal estimator that establishes $c_{t,d}=2$ for all $(t,d)\ne(1,0)$. Additionally, for many practic...
For some $\epsilon > 10^{-36}$ we give a $3/2-\epsilon$ approximation algorithm for metric TSP
We give highly efficient algorithms, and almost matching lower bounds, for a range of basic statisti...
International audienceWhat advantage do sequential procedures provide over batch algorithms for test...
We design a new, fast algorithm for agnostically learning univariate probability distributions whose...
Total variation distance (TV distance) is a fundamental notion of distance between probability distr...
We study the question of closeness testing for two discrete distributions. More precisely, given sam...
Modern data science calls for statistical inference algorithms that are both data-efficient and comp...
This paper )+O(1) Non-explicit [10,9] )+O(1) Lower bound [6, 9] 2. Preliminaries 2.1...
Estimating distributions over large alphabets is a fundamental machine-learning tenet. Yet no method...
We give highly efficient algorithms, and almost matching lower bounds, for a range of basic statisti...
Motivated by the recent empirical successes of deep generative models, we study the computational co...
We establish optimal Statistical Query (SQ) lower bounds for robustly learning certain families of d...
Given some arbitrary distribution D over {0, 1}n and arbitrary target function c∗, the problem of ag...
Let p be an unknown and arbitrary probability distribution over [0, 1). We con-sider the problem of ...
Consider the problem of nonparametric estimation of an unknown $\beta$-H\"older smooth density $p_{X...
For some $\epsilon > 10^{-36}$ we give a $3/2-\epsilon$ approximation algorithm for metric TSP
We give highly efficient algorithms, and almost matching lower bounds, for a range of basic statisti...
International audienceWhat advantage do sequential procedures provide over batch algorithms for test...
We design a new, fast algorithm for agnostically learning univariate probability distributions whose...
Total variation distance (TV distance) is a fundamental notion of distance between probability distr...
We study the question of closeness testing for two discrete distributions. More precisely, given sam...
Modern data science calls for statistical inference algorithms that are both data-efficient and comp...
This paper )+O(1) Non-explicit [10,9] )+O(1) Lower bound [6, 9] 2. Preliminaries 2.1...
Estimating distributions over large alphabets is a fundamental machine-learning tenet. Yet no method...
We give highly efficient algorithms, and almost matching lower bounds, for a range of basic statisti...
Motivated by the recent empirical successes of deep generative models, we study the computational co...
We establish optimal Statistical Query (SQ) lower bounds for robustly learning certain families of d...
Given some arbitrary distribution D over {0, 1}n and arbitrary target function c∗, the problem of ag...
Let p be an unknown and arbitrary probability distribution over [0, 1). We con-sider the problem of ...
Consider the problem of nonparametric estimation of an unknown $\beta$-H\"older smooth density $p_{X...
For some $\epsilon > 10^{-36}$ we give a $3/2-\epsilon$ approximation algorithm for metric TSP
We give highly efficient algorithms, and almost matching lower bounds, for a range of basic statisti...
International audienceWhat advantage do sequential procedures provide over batch algorithms for test...