We propose a feature selection approach for clustering which extends Koller and Sahami's mutual-information-based criterion to the unsupervised case. This is achieved with the help of a mixture-based model and the corresponding expectation-maximization algorithm. The result is a backward search scheme, able to sort the features by order of relevance. Finally, an MDL criterion is used to prune the sorted list of features, yielding a feature selection criterion. The proposed approach can be classied as a wrapper, since it wraps the mixture es-timation algorithm in an outer layer that performs feature selection. Preliminary experimental results show that the proposed method has promising performance.
MasterAlternative clustering algorithms target finding alternative groupings of a dataset on which t...
Feature selection is an important step for data mining and machine learning to deal with the curse o...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
In this work the principle of homogeneity between labels and data clusters is exploited in order to ...
Feature selection is an important step for data mining and machine learning. It can be used to reduc...
In this paper, a supervised feature selection approach is presented, which is based on metric applie...
Research on the problem of feature selection for clustering continues to develop. This is a challeng...
In data clustering, the problem of selecting the subset of most relevant features from the data has ...
International audienceSimultaneous selection of the number of clusters and of a relevant subset of f...
This paper introduces concepts and algorithms of feature selection, surveys existing feature selecti...
International audienceThis paper addresses the problem of clustering binary data with feature select...
Feature selection for clustering is difficult because, unlike in supervised learning, there are no c...
Abstract — The major idea of feature selection is to choose a subset of key variables by eliminating...
Abstract. Due to the absence of class labels, unsupervised feature se-lection is much more difficult...
This paper presents a new relevance index based on mutual information that is based on labeled and u...
MasterAlternative clustering algorithms target finding alternative groupings of a dataset on which t...
Feature selection is an important step for data mining and machine learning to deal with the curse o...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
In this work the principle of homogeneity between labels and data clusters is exploited in order to ...
Feature selection is an important step for data mining and machine learning. It can be used to reduc...
In this paper, a supervised feature selection approach is presented, which is based on metric applie...
Research on the problem of feature selection for clustering continues to develop. This is a challeng...
In data clustering, the problem of selecting the subset of most relevant features from the data has ...
International audienceSimultaneous selection of the number of clusters and of a relevant subset of f...
This paper introduces concepts and algorithms of feature selection, surveys existing feature selecti...
International audienceThis paper addresses the problem of clustering binary data with feature select...
Feature selection for clustering is difficult because, unlike in supervised learning, there are no c...
Abstract — The major idea of feature selection is to choose a subset of key variables by eliminating...
Abstract. Due to the absence of class labels, unsupervised feature se-lection is much more difficult...
This paper presents a new relevance index based on mutual information that is based on labeled and u...
MasterAlternative clustering algorithms target finding alternative groupings of a dataset on which t...
Feature selection is an important step for data mining and machine learning to deal with the curse o...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...