In data clustering, the problem of selecting the subset of most relevant features from the data has been an active research topic. Feature selection for clustering is a challenging task due to the absence of class labels for guiding the search for relevant features. Most methods proposed for this goal are focused on numerical data. In this work, we propose an approach for clustering and selecting categorical features simultaneously. We assume that the data originate from a finite mixture of multinomial distributions and implement an integrated expectation-maximization (EM) algorithm that estimates all the parameters of the model and selects the subset of relevant features simultaneously. The results obtained on synthetic data illustrate the...
In this study, we consider unsupervised clustering of categorical vectors that can be of different s...
We develop new algorithmic methods with provable guarantees for feature selection in regard to categ...
Abstract — The major idea of feature selection is to choose a subset of key variables by eliminating...
Research on the problem of feature selection for clustering continues to develop. This is a challeng...
Research on cluster analysis for categorical data continues to develop, new clustering algorithms be...
We consider the problem of variable or feature selection for model-based clustering. The problem of ...
Cluster analysis for categorical data has been an active area of research. A well-known problem in ...
International audienceThis paper addresses the problem of clustering binary data with feature select...
In clustering, one may be interested in the classification of similar objects into groups, and one m...
Data mining, the extraction of hidden predictive information from large databases, is a powerful new...
Variable selection for cluster analysis is a difficult problem. The difficulty originates not only f...
This paper introduces concepts and algorithms of feature selection, surveys existing feature selecti...
In data mining, clustering analysis is an important research area. The goal of clustering is to grou...
We propose a feature selection approach for clustering which extends Koller and Sahami's mutual...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
In this study, we consider unsupervised clustering of categorical vectors that can be of different s...
We develop new algorithmic methods with provable guarantees for feature selection in regard to categ...
Abstract — The major idea of feature selection is to choose a subset of key variables by eliminating...
Research on the problem of feature selection for clustering continues to develop. This is a challeng...
Research on cluster analysis for categorical data continues to develop, new clustering algorithms be...
We consider the problem of variable or feature selection for model-based clustering. The problem of ...
Cluster analysis for categorical data has been an active area of research. A well-known problem in ...
International audienceThis paper addresses the problem of clustering binary data with feature select...
In clustering, one may be interested in the classification of similar objects into groups, and one m...
Data mining, the extraction of hidden predictive information from large databases, is a powerful new...
Variable selection for cluster analysis is a difficult problem. The difficulty originates not only f...
This paper introduces concepts and algorithms of feature selection, surveys existing feature selecti...
In data mining, clustering analysis is an important research area. The goal of clustering is to grou...
We propose a feature selection approach for clustering which extends Koller and Sahami's mutual...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
In this study, we consider unsupervised clustering of categorical vectors that can be of different s...
We develop new algorithmic methods with provable guarantees for feature selection in regard to categ...
Abstract — The major idea of feature selection is to choose a subset of key variables by eliminating...