We study the graph clustering problem where each observation (edge or no-edge between a pair of nodes) may have a different level of confi-dence/uncertainty. We propose a clustering al-gorithm that is based on optimizing an appro-priate weighted objective, where larger weights are given to observations with lower uncertainty. Our approach leads to a convex optimization problem that is efficiently solvable. We analyze our approach under a natural generative model, and establish theoretical guarantees for recover-ing the underlying clusters. Our main result is a general theorem that applies to any given weight and distribution for the uncertainty. By optimiz-ing over the weights, we derive a provably opti-mal weighting scheme, which matches t...
A wide range of applications in engineering as well as the natural and social sciences have datasets...
We formulate weighted graph clustering as a prediction problem: given a subset of edge weights we an...
We consider a semi-supervised clustering problem where the locations of the data objects are subject...
An uncertain graph G = (V,E,p) can be viewed as a probability space whose outcomes (referred to as p...
This paper considers the problem of clustering a partially observed unweighted graph—i.e., one where...
We consider the problem of finding clusters in an unweighted graph, when the graph is partially obse...
The problem of finding clusters in a graph arises in several ap-plications such as social networks, ...
The problem of finding clusters in a graph arises in several applications such as social networks, d...
Graph clustering involves the task of partitioning nodes, so that the edge density is higher within ...
Classical clustering algorithms typically either lack an underlying probability framework to make th...
We propose a new data-driven technique for constructing uncertainty sets for robust optimization pro...
In graph theory and network analysis, communities or clusters are sets of nodes in a graph that shar...
As a model problem for clustering, we consider the densest k-disjoint-clique problem of partitioning...
Graph clustering is widely-studied unsupervised learning problem in which the task is to group simil...
We consider a semi-supervised clustering problem where the locations of the data objects are subject...
A wide range of applications in engineering as well as the natural and social sciences have datasets...
We formulate weighted graph clustering as a prediction problem: given a subset of edge weights we an...
We consider a semi-supervised clustering problem where the locations of the data objects are subject...
An uncertain graph G = (V,E,p) can be viewed as a probability space whose outcomes (referred to as p...
This paper considers the problem of clustering a partially observed unweighted graph—i.e., one where...
We consider the problem of finding clusters in an unweighted graph, when the graph is partially obse...
The problem of finding clusters in a graph arises in several ap-plications such as social networks, ...
The problem of finding clusters in a graph arises in several applications such as social networks, d...
Graph clustering involves the task of partitioning nodes, so that the edge density is higher within ...
Classical clustering algorithms typically either lack an underlying probability framework to make th...
We propose a new data-driven technique for constructing uncertainty sets for robust optimization pro...
In graph theory and network analysis, communities or clusters are sets of nodes in a graph that shar...
As a model problem for clustering, we consider the densest k-disjoint-clique problem of partitioning...
Graph clustering is widely-studied unsupervised learning problem in which the task is to group simil...
We consider a semi-supervised clustering problem where the locations of the data objects are subject...
A wide range of applications in engineering as well as the natural and social sciences have datasets...
We formulate weighted graph clustering as a prediction problem: given a subset of edge weights we an...
We consider a semi-supervised clustering problem where the locations of the data objects are subject...