Thesis (Ph.D.)--Boston UniversityIn machine learning, the problem of unsupervised learning is that of trying to explain key features and find hidden structures in unlabeled data. In this thesis we focus on three unsupervised learning scenarios: graph based clustering with imbalanced data, point-wise anomaly detection and anomalous cluster detection on graphs. In the first part we study spectral clustering, a popular graph based clustering technique. We investigate the reason why spectral clustering performs badly on imbalanced and proximal data. We then propose the partition constrained minimum cut (PCut) framework based on a novel parametric graph construction method, that is shown to adapt to different degrees of imbalanced data. We anal...
This course project provide the basic theory of spectral clustering from a graph partitioning point ...
Clustering is used in identifying groups of samples with similar properties, and it is one of the mo...
Abstract. We present a set of clustering algorithms that identify cluster boundaries by searching fo...
Anomaly detection is a data partitioning algorithm which separates outliers from normative data poin...
This book summarizes the state-of-the-art in unsupervised learning. The contributors discuss how wit...
Spectral clustering methods that are frequently used in clustering and community detection applicati...
In this study, we propose a better relationship based clustering framework for dealing with unbalanc...
Finally, we study how to construct an appropriate graph for spectral clustering. Given a local simil...
International audienceIn this article, we propose a semi-supervised version of spectral clustering, ...
Clustering analysis is one of the main tools for exploratory data analysis, with applications from s...
© 2012 IEEE. Spectral clustering (SC) has been proven to be effective in various applications. Howev...
The demand for analyzing patterns and structures of data is growing dramatically in recent years. Th...
Fast and eective unsupervised clustering is a fundamental tool in unsupervised learning. Here is a n...
High-dimensional data are becoming increasingly pervasive, and bring new problems and opportunities ...
Clustering is a central topic in unsupervised learning and has a wide variety of applications. Howev...
This course project provide the basic theory of spectral clustering from a graph partitioning point ...
Clustering is used in identifying groups of samples with similar properties, and it is one of the mo...
Abstract. We present a set of clustering algorithms that identify cluster boundaries by searching fo...
Anomaly detection is a data partitioning algorithm which separates outliers from normative data poin...
This book summarizes the state-of-the-art in unsupervised learning. The contributors discuss how wit...
Spectral clustering methods that are frequently used in clustering and community detection applicati...
In this study, we propose a better relationship based clustering framework for dealing with unbalanc...
Finally, we study how to construct an appropriate graph for spectral clustering. Given a local simil...
International audienceIn this article, we propose a semi-supervised version of spectral clustering, ...
Clustering analysis is one of the main tools for exploratory data analysis, with applications from s...
© 2012 IEEE. Spectral clustering (SC) has been proven to be effective in various applications. Howev...
The demand for analyzing patterns and structures of data is growing dramatically in recent years. Th...
Fast and eective unsupervised clustering is a fundamental tool in unsupervised learning. Here is a n...
High-dimensional data are becoming increasingly pervasive, and bring new problems and opportunities ...
Clustering is a central topic in unsupervised learning and has a wide variety of applications. Howev...
This course project provide the basic theory of spectral clustering from a graph partitioning point ...
Clustering is used in identifying groups of samples with similar properties, and it is one of the mo...
Abstract. We present a set of clustering algorithms that identify cluster boundaries by searching fo...