© 2019 Neural information processing systems foundation. All rights reserved. Overlapping clusters are common in models of many practical data-segmentation applications. Suppose we are given n elements to be clustered into k possibly overlapping clusters, and an oracle that can interactively answer queries of the form “do elements u and v belong to the same cluster?” The goal is to recover the clusters with minimum number of such queries. This problem has been of recent interest for the case of disjoint clusters. In this paper, we look at the more practical scenario of overlapping clusters, and provide upper bounds (with algorithms) on the sufficient number of queries. We provide algorithmic results under both arbitrary (worst-case) and sta...
Abstract-One of the fundamental questions in the study of complex networks is community detection, i...
Given a finite ordered set of items and an unknown distinguished subset P of up to p positive elemen...
Data clustering techniques have been applied to extract information from gene expression data for tw...
Given a dataset, traditional clustering algorithms often only provide a single partitioning or a sin...
We study clustering over multiple graphs- each encoding a distinct set of similarity relationships (...
textAnalysis of large collections of data has become inescapable in many areas of scientific and com...
Motivated by many applications, we study clustering with a faulty oracle. In this problem, there are...
External validation indexes allow similarities between two clustering solutions to be quantified. Wi...
Technically, the problem of overlap in a dataset is viewed as an uncertainty problem and is solved ...
Graph clustering is widely-studied unsupervised learning problem in which the task is to group simil...
International audienceIn correlation clustering, we are given $n$ objects together with a binary sim...
Clustering aims to group together data instances which are similar while simultaneously separating t...
This thesis focuses on solving the $K$-means clustering problem approximately with side information ...
Different clustering algorithms have different strengths and weaknesses. Given a dataset and a clus...
Determining the number of clusters is one of the most important topics in cluster analysis. The abil...
Abstract-One of the fundamental questions in the study of complex networks is community detection, i...
Given a finite ordered set of items and an unknown distinguished subset P of up to p positive elemen...
Data clustering techniques have been applied to extract information from gene expression data for tw...
Given a dataset, traditional clustering algorithms often only provide a single partitioning or a sin...
We study clustering over multiple graphs- each encoding a distinct set of similarity relationships (...
textAnalysis of large collections of data has become inescapable in many areas of scientific and com...
Motivated by many applications, we study clustering with a faulty oracle. In this problem, there are...
External validation indexes allow similarities between two clustering solutions to be quantified. Wi...
Technically, the problem of overlap in a dataset is viewed as an uncertainty problem and is solved ...
Graph clustering is widely-studied unsupervised learning problem in which the task is to group simil...
International audienceIn correlation clustering, we are given $n$ objects together with a binary sim...
Clustering aims to group together data instances which are similar while simultaneously separating t...
This thesis focuses on solving the $K$-means clustering problem approximately with side information ...
Different clustering algorithms have different strengths and weaknesses. Given a dataset and a clus...
Determining the number of clusters is one of the most important topics in cluster analysis. The abil...
Abstract-One of the fundamental questions in the study of complex networks is community detection, i...
Given a finite ordered set of items and an unknown distinguished subset P of up to p positive elemen...
Data clustering techniques have been applied to extract information from gene expression data for tw...