Summary: LOVE, a robust, scalable latent model-based clustering method for biological discovery, can be used across a range of datasets to generate both overlapping and non-overlapping clusters. In our formulation, a cluster comprises variables associated with the same latent factor and is determined from an allocation matrix that indexes our latent model. We prove that the allocation matrix and corresponding clusters are uniquely defined. We apply LOVE to biological datasets (gene expression, serological responses measured from HIV controllers and chronic progressors, vaccine-induced humoral immune responses) resulting in meaningful biological output. For all three datasets, the clusters generated by LOVE remain stable across tuning parame...
Publisher's PDF.Estimating the optimal number of clusters is a major challenge in applying cluster a...
Many machine learning problems in biology involve clustering data generated in complex or incomplete...
There is a growing need for unbiased clustering algorithms, ideally automated to analyze complex dat...
Clustering is a long-standing problem in computer science and is applied in virtually any scientific...
Various high-throughput technologies have fueled advances in biomedical research in the last decade....
This thesis explores and evaluates MAXCCLUS, a bioinformatics clustering algorithm, which was design...
In many fields, researchers are confronted by datasets whose variables demonstrate grouping patterns...
Massively high-dimensional datasets are fast becoming commonplace and any advances in the reliable p...
Clustering is the task of organizing a set of objects into meaningful groups. These groups can be di...
While the vast majority of clustering algorithms are partitional, many real world datasets have inhe...
Detecting clusters of data points in physical or high-dimensional (HD) space is a common task in bio...
which permits unrestricted use, distribution, and reproduction in any medium, provided the original ...
Background: The big data moniker is nowhere better deserved than to describe the ever-increasing pro...
Motivation: Over the last decade, a large variety of clustering algorithms have been developed to de...
International audienceThe aim of genetic association studies, and in particular genome-wide associat...
Publisher's PDF.Estimating the optimal number of clusters is a major challenge in applying cluster a...
Many machine learning problems in biology involve clustering data generated in complex or incomplete...
There is a growing need for unbiased clustering algorithms, ideally automated to analyze complex dat...
Clustering is a long-standing problem in computer science and is applied in virtually any scientific...
Various high-throughput technologies have fueled advances in biomedical research in the last decade....
This thesis explores and evaluates MAXCCLUS, a bioinformatics clustering algorithm, which was design...
In many fields, researchers are confronted by datasets whose variables demonstrate grouping patterns...
Massively high-dimensional datasets are fast becoming commonplace and any advances in the reliable p...
Clustering is the task of organizing a set of objects into meaningful groups. These groups can be di...
While the vast majority of clustering algorithms are partitional, many real world datasets have inhe...
Detecting clusters of data points in physical or high-dimensional (HD) space is a common task in bio...
which permits unrestricted use, distribution, and reproduction in any medium, provided the original ...
Background: The big data moniker is nowhere better deserved than to describe the ever-increasing pro...
Motivation: Over the last decade, a large variety of clustering algorithms have been developed to de...
International audienceThe aim of genetic association studies, and in particular genome-wide associat...
Publisher's PDF.Estimating the optimal number of clusters is a major challenge in applying cluster a...
Many machine learning problems in biology involve clustering data generated in complex or incomplete...
There is a growing need for unbiased clustering algorithms, ideally automated to analyze complex dat...