Coresets for Nonparametric Estimation — the Case of DP-Means

Olivier Bachem
Mario Lucic
Andreas Krause

Publication date

October 2016

Abstract

Scalable training of Bayesian nonparametric models is a notoriously difficult challenge. We explore the use of coresets – a data summariza-tion technique originating from computational geometry – for this task. Coresets are weighted subsets of the data such that models trained on these coresets are provably competitive with models trained on the full dataset. Coresets sublinear in the dataset size allow for fast approximate inference with provable guarantees. Existing constructions, however, are limited to parametric problems. Using novel techniques in coreset construction we show the existence of coresets for DP-Means – a prototypical nonparametric clustering problem – and provide a practical construction algorithm. We empiri-cally demonst...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Coresets for Nonparametric Estimation — the Case of DP-Means

Abstract

Extracted data

Coresets for Nonparametric Estimation — the Case of DP-Means

Abstract

Extracted data

Related items

Related items