This dissertation takes a relationship-based approach to cluster analysis of high (1000 and more) dimensional data that side-steps the ‘curse of dimensionality’ issue by working in a suitable similarity space instead of the original feature space. We propose two frameworks that leverage graph algorithms to achieve relationship-based clustering and visualization, respectively. In the visualization framework, the output from the clustering algorithm is used to reorder the data points so that the resulting permuted similarity matrix can be readily visualized in 2 dimensions, with clusters showing up as bands. Results on retail transaction, document (bag-of-words), and web-log data show that our approach can yield superior results while ...
Due to the technological progress over the last decades, today’s scientific and commercial applicati...
This dissertation focuses on the topic of relational data clustering, which is the task of organizin...
We study clustering over multiple graphs- each encoding a distinct set of similarity relationships (...
This dissertation takes a relationship-based approach to cluster analysis of high (1000 and more) d...
In this study, we propose a modified version of relationship based clustering framework dealing with...
In this study, we propose a modified version of relationship based clustering framework dealing with...
The purpose of this thesis is to present our research works on some of the fundamental issues encoun...
This paper studies cluster ensembles for high dimensional data clustering. We examine three differen...
We performed an investigation of how several data relationship discovery algorithms can be combined ...
In this paper, we introduce a novel similarity measure for relational data. It is the first measure ...
The goal of this study was to develop an efficient clustering framework for processing high-dimensio...
Abstract---- Clustering is process for finding similarity groups in data. It is considered as unsupe...
Clustering is an underspecified task: there are no universal criteria for what makes a good clusteri...
Clustering is an essential data mining task with numerous applications. Clustering is the process of...
This thesis studies two unsupervised pattern discovery problems within the context of scientific app...
Due to the technological progress over the last decades, today’s scientific and commercial applicati...
This dissertation focuses on the topic of relational data clustering, which is the task of organizin...
We study clustering over multiple graphs- each encoding a distinct set of similarity relationships (...
This dissertation takes a relationship-based approach to cluster analysis of high (1000 and more) d...
In this study, we propose a modified version of relationship based clustering framework dealing with...
In this study, we propose a modified version of relationship based clustering framework dealing with...
The purpose of this thesis is to present our research works on some of the fundamental issues encoun...
This paper studies cluster ensembles for high dimensional data clustering. We examine three differen...
We performed an investigation of how several data relationship discovery algorithms can be combined ...
In this paper, we introduce a novel similarity measure for relational data. It is the first measure ...
The goal of this study was to develop an efficient clustering framework for processing high-dimensio...
Abstract---- Clustering is process for finding similarity groups in data. It is considered as unsupe...
Clustering is an underspecified task: there are no universal criteria for what makes a good clusteri...
Clustering is an essential data mining task with numerous applications. Clustering is the process of...
This thesis studies two unsupervised pattern discovery problems within the context of scientific app...
Due to the technological progress over the last decades, today’s scientific and commercial applicati...
This dissertation focuses on the topic of relational data clustering, which is the task of organizin...
We study clustering over multiple graphs- each encoding a distinct set of similarity relationships (...