We devise coresets for kernel $k$-Means with a general kernel, and use them to obtain new, more efficient, algorithms. Kernel $k$-Means has superior clustering capability compared to classical $k$-Means, particularly when clusters are non-linearly separable, but it also introduces significant computational challenges. We address this computational issue by constructing a coreset, which is a reduced dataset that accurately preserves the clustering costs. Our main result is a coreset for kernel $k$-Means that works for a general kernel and has size $\mathrm{poly}(k\epsilon^{-1})$. Our new coreset both generalizes and greatly improves all previous results; moreover, it can be constructed in time near-linear in $n$. This result immediately im...
Kernel k-means is useful for performing clustering on nonlinearly separable data. The kernel k-means...
We study approximation algorithms for k-median clustering. We obtain small coresets for k-median clu...
Coresets are among the most popular paradigms for summarizing data. In particular, there exist many ...
The k-means problem seeks a clustering that minimizes the sum of squared errors cost function: For i...
Kernel Methods are algorithms that implicitly perform a nonlinear mapping of the input data to a hig...
AbstractKernel k-Means is a basis for many state of the art global clustering approaches. When the n...
k'-means algorithm is a new improvement of k-means algorithm. It implements a rewarding and pen...
Multiple kernel $k$-means (MKKM) aims to improve clustering performance by learning an optimal kerne...
This paper poses the question of whether or not the usage of the kernel trick is justified. We inves...
The fuzzy K-means problem is a popular generalization of the well-known K-means problem to soft clus...
We propose a novel clustering technique based on kernel methods. We exploit the geometric properties...
In this paper, we show that there exists a (k, ε)-coreset for k-median and k-means clustering of n p...
The kernel k-means is an effective method for data cluster-ing which extends the commonly-used k-mea...
We present a novel method to learn arbitrary cluster boundaries by extending the k-means algorithm...
By always mapping data from lower dimensional s-pace into higher or even infinite dimensional space,...
Kernel k-means is useful for performing clustering on nonlinearly separable data. The kernel k-means...
We study approximation algorithms for k-median clustering. We obtain small coresets for k-median clu...
Coresets are among the most popular paradigms for summarizing data. In particular, there exist many ...
The k-means problem seeks a clustering that minimizes the sum of squared errors cost function: For i...
Kernel Methods are algorithms that implicitly perform a nonlinear mapping of the input data to a hig...
AbstractKernel k-Means is a basis for many state of the art global clustering approaches. When the n...
k'-means algorithm is a new improvement of k-means algorithm. It implements a rewarding and pen...
Multiple kernel $k$-means (MKKM) aims to improve clustering performance by learning an optimal kerne...
This paper poses the question of whether or not the usage of the kernel trick is justified. We inves...
The fuzzy K-means problem is a popular generalization of the well-known K-means problem to soft clus...
We propose a novel clustering technique based on kernel methods. We exploit the geometric properties...
In this paper, we show that there exists a (k, ε)-coreset for k-median and k-means clustering of n p...
The kernel k-means is an effective method for data cluster-ing which extends the commonly-used k-mea...
We present a novel method to learn arbitrary cluster boundaries by extending the k-means algorithm...
By always mapping data from lower dimensional s-pace into higher or even infinite dimensional space,...
Kernel k-means is useful for performing clustering on nonlinearly separable data. The kernel k-means...
We study approximation algorithms for k-median clustering. We obtain small coresets for k-median clu...
Coresets are among the most popular paradigms for summarizing data. In particular, there exist many ...