The increased collection of high-dimensional data in various fields has raised a strong interest in clustering algorithms and variable selection procedures. In this disserta- tion, I propose a model-based method that addresses the two problems simultane- ously. I use Dirichlet process mixture models to define the cluster structure and to introduce in the model a latent binary vector to identify discriminating variables. I update the variable selection index using a Metropolis algorithm and obtain inference on the cluster structure via a split-merge Markov chain Monte Carlo technique. I evaluate the method on simulated data and illustrate an application with a DNA microarray study. I also show that the methodology can be adapted to the probl...
The Dirichlet Process (DP) mixture model has become a popular choice for model-based clustering, lar...
Mixture of factor analysers (MFA) is a well-known model that combines the dimensionality reduction t...
The Dirichlet Process (DP) mixture model has become a popular choice for model-based clustering, lar...
<p>Clustering methods are designed to separate heterogeneous data into groups of similar objects suc...
AbstractClustering is one of the most widely used procedures in the analysis of microarray data, for...
Cluster analysis is concerned with partitioning cases into clusters such that the cases in a cluster...
We describe a non-parametric Bayesian model using genotype data to classify individuals among popula...
Statistical analysis of data sets of high-dimensionality has met great interest over the past years,...
The thesis tackles the problem of uncovering hidden structures in high-dimensional data in the prese...
Latent variable models are used extensively in unsupervised learning within the Bayesian paradigm, t...
One of the most important goals of unsupervised learning is to discover meaningful clusters in data....
We have designed and implemented a finite mixture model, using the scaled Dirichlet distribution for...
Following a review of some traditional methods of clustering, we review the Bayesian nonparametric ...
The R package bclust is useful for clustering high-dimensional continuous data. The package uses a p...
Thesis (Ph.D.)--University of Rochester. School of Medicine & Dentistry. Dept. of Biostatistics and ...
The Dirichlet Process (DP) mixture model has become a popular choice for model-based clustering, lar...
Mixture of factor analysers (MFA) is a well-known model that combines the dimensionality reduction t...
The Dirichlet Process (DP) mixture model has become a popular choice for model-based clustering, lar...
<p>Clustering methods are designed to separate heterogeneous data into groups of similar objects suc...
AbstractClustering is one of the most widely used procedures in the analysis of microarray data, for...
Cluster analysis is concerned with partitioning cases into clusters such that the cases in a cluster...
We describe a non-parametric Bayesian model using genotype data to classify individuals among popula...
Statistical analysis of data sets of high-dimensionality has met great interest over the past years,...
The thesis tackles the problem of uncovering hidden structures in high-dimensional data in the prese...
Latent variable models are used extensively in unsupervised learning within the Bayesian paradigm, t...
One of the most important goals of unsupervised learning is to discover meaningful clusters in data....
We have designed and implemented a finite mixture model, using the scaled Dirichlet distribution for...
Following a review of some traditional methods of clustering, we review the Bayesian nonparametric ...
The R package bclust is useful for clustering high-dimensional continuous data. The package uses a p...
Thesis (Ph.D.)--University of Rochester. School of Medicine & Dentistry. Dept. of Biostatistics and ...
The Dirichlet Process (DP) mixture model has become a popular choice for model-based clustering, lar...
Mixture of factor analysers (MFA) is a well-known model that combines the dimensionality reduction t...
The Dirichlet Process (DP) mixture model has become a popular choice for model-based clustering, lar...