The Dirichlet Process (DP) mixture model has become a popular choice for model-based clustering, largely because it allows the number of clusters to be inferred. The sequential updating and greedy search (SUGS) algorithm (Wang & Dunson, 2011) was proposed as a fast method for performing approximate Bayesian inference in DP mixture models, by posing clustering as a Bayesian model selection (BMS) problem and avoiding the use of computationally costly Markov chain Monte Carlo methods. Here we consider how this approach may be extended to permit variable selection for clustering, and also demonstrate the benefits of Bayesian model averaging (BMA) in place of BMS. Through an array of simulation examples and well-studied examples from cancer tran...
International audienceFlow cytometry is a high-throughput technology used to quantify multiple surfa...
39 pages, 11 figuresInternational audienceFlow cytometry is a high-throughput technology used to qua...
Cluster analysis is concerned with partitioning cases into clusters such that the cases in a cluster...
The Dirichlet Process (DP) mixture model has become a popular choice for model-based clustering, lar...
In this article we propose an improvement on the sequential updating and greedy search (SUGS) algori...
Summary There has been increasing interest in applying Bayesian nonparametric methods in large sampl...
We present fastbaps, a fast solution to the genetic clustering problem. Fastbaps rapidly identifies ...
AbstractClustering is one of the most widely used procedures in the analysis of microarray data, for...
The Dirichlet process mixture model (DPMM) is a ubiquitous, flexible Bayesian nonparametric statisti...
The Dirichlet process mixture (DPM) is a ubiquitous, flexible Bayesian nonparametric statistical mod...
The K-means algorithm is one of the most popular clustering algorithms in current use as it is relat...
Cancers from different tissue types can share a latent structure reflecting commonly altered gene pa...
Although the use of clustering methods has rapidly become one of the standard computational approach...
We present fastbaps, a fast solution to the genetic clustering problem. Fastbaps rapidly identifies ...
Although the use of clustering methods has rapidly become one of the standard computational approach...
International audienceFlow cytometry is a high-throughput technology used to quantify multiple surfa...
39 pages, 11 figuresInternational audienceFlow cytometry is a high-throughput technology used to qua...
Cluster analysis is concerned with partitioning cases into clusters such that the cases in a cluster...
The Dirichlet Process (DP) mixture model has become a popular choice for model-based clustering, lar...
In this article we propose an improvement on the sequential updating and greedy search (SUGS) algori...
Summary There has been increasing interest in applying Bayesian nonparametric methods in large sampl...
We present fastbaps, a fast solution to the genetic clustering problem. Fastbaps rapidly identifies ...
AbstractClustering is one of the most widely used procedures in the analysis of microarray data, for...
The Dirichlet process mixture model (DPMM) is a ubiquitous, flexible Bayesian nonparametric statisti...
The Dirichlet process mixture (DPM) is a ubiquitous, flexible Bayesian nonparametric statistical mod...
The K-means algorithm is one of the most popular clustering algorithms in current use as it is relat...
Cancers from different tissue types can share a latent structure reflecting commonly altered gene pa...
Although the use of clustering methods has rapidly become one of the standard computational approach...
We present fastbaps, a fast solution to the genetic clustering problem. Fastbaps rapidly identifies ...
Although the use of clustering methods has rapidly become one of the standard computational approach...
International audienceFlow cytometry is a high-throughput technology used to quantify multiple surfa...
39 pages, 11 figuresInternational audienceFlow cytometry is a high-throughput technology used to qua...
Cluster analysis is concerned with partitioning cases into clusters such that the cases in a cluster...