We consider the problem of variable or feature selection for model-based clustering. The problem of comparing two nested subsets of variables is recast as a model comparison problem and addressed using approximate Bayes factors. A greedy search algorithm is proposed for finding a local optimum in model space. The resulting method selects variables (or features), the number of clusters, and the clustering model simultaneously. We applied the method to several simulated and real examples and found that removing irrelevant variables often improved performance. Compared with methods based on all of the variables, our variable selection method consistently yielded more accurate estimates of the number of groups and lower classification error rat...
International audienceSeveral methods for variable selection have been proposed in model-based clust...
International audienceSeveral methods for variable selection have been proposed in model-based clust...
Model-based clustering is a popular approach for clustering multivariate data which has seen applica...
We compare two major approaches to variable selection in clustering: model selection and regularizat...
We compare two major approaches to variable selection in clustering: model selection and regularizat...
We compare two major approaches to variable selection in clustering: model selection and regularizat...
We compare two major approaches to variable selection in clustering: model selection and regularizat...
Variable selection for cluster analysis is a difficult problem. The difficulty originates not only f...
The currently available variable selection procedures in model-based clustering assume that the irre...
The currently available variable selection procedures in model-based clustering assume that the irre...
International audienceWe compare two major approaches to variable selection in clustering: model sel...
International audienceWe compare two major approaches to variable selection in clustering: model sel...
Thesis (Ph.D.)--University of Rochester. School of Medicine & Dentistry. Dept. of Biostatistics and ...
We propose a method for selecting variables in latent class analysis, which is the most common model...
Model-based clustering is a popular approach for clustering multivariate data which has seen applica...
International audienceSeveral methods for variable selection have been proposed in model-based clust...
International audienceSeveral methods for variable selection have been proposed in model-based clust...
Model-based clustering is a popular approach for clustering multivariate data which has seen applica...
We compare two major approaches to variable selection in clustering: model selection and regularizat...
We compare two major approaches to variable selection in clustering: model selection and regularizat...
We compare two major approaches to variable selection in clustering: model selection and regularizat...
We compare two major approaches to variable selection in clustering: model selection and regularizat...
Variable selection for cluster analysis is a difficult problem. The difficulty originates not only f...
The currently available variable selection procedures in model-based clustering assume that the irre...
The currently available variable selection procedures in model-based clustering assume that the irre...
International audienceWe compare two major approaches to variable selection in clustering: model sel...
International audienceWe compare two major approaches to variable selection in clustering: model sel...
Thesis (Ph.D.)--University of Rochester. School of Medicine & Dentistry. Dept. of Biostatistics and ...
We propose a method for selecting variables in latent class analysis, which is the most common model...
Model-based clustering is a popular approach for clustering multivariate data which has seen applica...
International audienceSeveral methods for variable selection have been proposed in model-based clust...
International audienceSeveral methods for variable selection have been proposed in model-based clust...
Model-based clustering is a popular approach for clustering multivariate data which has seen applica...