We study a generalization of the famous k-center problem where each object is an affine subspace of dimension Δ, and give either the first or significantly improved algorithms and hardness results for many combinations of parameters. This generalization from points (Δ = 0) is motivated by the analysis of incomplete data, a pervasive challenge in statistics: incomplete data objects in ℝd can be modeled as affine subspaces. We give three algorithmic results for different values of k, under the assumption that all subspaces are axis-parallel, the main case of interest because of the correspondence to missing entries in data tables. 1) k = 1: Two polynomial time approximation schemes which runs in poly (Δ, 1/∊)nd. 2) k = 2: O(Δ1/4...
The analysis of incomplete data is a long-standing challenge in practical statistics. When, as is ty...
We present a general approach for designing approximation algorithms for a fundamental class of geom...
We show that k-means clustering is an NP-hard optimization problem, even for instances in the plane....
We study a generalization of the famous k-center problem where each object is an affine subspace of ...
We study a generalization of the famous k-center problem where each object is an affine subspace of ...
In this paper, we present a linear-time approximation scheme for k-means clustering of incomplete da...
We consider a generalization of the fundamental k-means clustering for data with incomplete or corru...
The two most popular unsupervised learning problems are k-Clustering and Low-Rank Approximation. Con...
In this thesis we show that, for several clustering problems, we can extract a small set of points, ...
The Euclidean k-means problem is a classical problem that has been extensively studied in the theore...
Clustering problems often arise in fields like data mining and machine learning. Clustering usually ...
We show that for n points in d-dimensional Euclidean space, a data oblivious random projection of th...
We present algorithms for three geometric problems -- clustering, orienteering, and conflict-free co...
In this paper, we show that for several clustering problems one can extract a small set of points, s...
The analysis of incomplete data is a long-standing challenge in practical statistics. When, as is ty...
The analysis of incomplete data is a long-standing challenge in practical statistics. When, as is ty...
We present a general approach for designing approximation algorithms for a fundamental class of geom...
We show that k-means clustering is an NP-hard optimization problem, even for instances in the plane....
We study a generalization of the famous k-center problem where each object is an affine subspace of ...
We study a generalization of the famous k-center problem where each object is an affine subspace of ...
In this paper, we present a linear-time approximation scheme for k-means clustering of incomplete da...
We consider a generalization of the fundamental k-means clustering for data with incomplete or corru...
The two most popular unsupervised learning problems are k-Clustering and Low-Rank Approximation. Con...
In this thesis we show that, for several clustering problems, we can extract a small set of points, ...
The Euclidean k-means problem is a classical problem that has been extensively studied in the theore...
Clustering problems often arise in fields like data mining and machine learning. Clustering usually ...
We show that for n points in d-dimensional Euclidean space, a data oblivious random projection of th...
We present algorithms for three geometric problems -- clustering, orienteering, and conflict-free co...
In this paper, we show that for several clustering problems one can extract a small set of points, s...
The analysis of incomplete data is a long-standing challenge in practical statistics. When, as is ty...
The analysis of incomplete data is a long-standing challenge in practical statistics. When, as is ty...
We present a general approach for designing approximation algorithms for a fundamental class of geom...
We show that k-means clustering is an NP-hard optimization problem, even for instances in the plane....