We examine feature selection algorithms for handling data sets with many features. We introduce the Random Forest approach and the Scatter Search method as algorithms for selecting relevant features. We then present a general 3-step approach for data sets with thousands of features. Step 1 consists of a Feature Space Reduction process where the majority of irrelevant features are eliminated. This is followed by a Feature Subset Optimization process (Step 2) where we attempt to search for a nearoptimal and small subset in the reduced space. These subsets are then evaluated in Step 3 for their efficiency in classifying the samples in the data. We tested our approach with two gene expression data sets. One of the data sets has 62 samples and 2...
AbstractClassification of gene expression data plays a significant role in prediction and diagnosis ...
Identifying genes linked to the appearance of certain types of cancers and their phenotypes is a wel...
Gene expression data often need to be classified into classes or grouped into clusters for further a...
AbstractClassification of gene expression data plays a significant role in prediction and diagnosis ...
Although there are several causes of cancer, scientists have made a major breakthrough in discoveri...
Although there are several causes of cancer, scientists have made a major breakthrough in discoveri...
Gene expression data is a very complex data set characterised by abundant numbers of features but wi...
Recently, feature selection and dimensionality reduction have become fundamental tools for many data...
AbstractThe DNA microarray technology has capability to determine the levels of thousands of gene si...
Motivation: The increasing use of DNA microarray-based tumor gene expression profiles for cancer dia...
Abstract: In fact, cancer is produced for genetic reasons. So, gene feature selection techniques are...
(Aim) Gene expression data is typically high dimensional with a limited number of samples and contai...
Background: Feature selection is a pattern recognition approach to choose important variables accord...
Machine Learning methods have of late made signicant efforts to solving multidisciplinary problems i...
Machine Learning methods have of late made signicant efforts to solving multidisciplinary problems i...
AbstractClassification of gene expression data plays a significant role in prediction and diagnosis ...
Identifying genes linked to the appearance of certain types of cancers and their phenotypes is a wel...
Gene expression data often need to be classified into classes or grouped into clusters for further a...
AbstractClassification of gene expression data plays a significant role in prediction and diagnosis ...
Although there are several causes of cancer, scientists have made a major breakthrough in discoveri...
Although there are several causes of cancer, scientists have made a major breakthrough in discoveri...
Gene expression data is a very complex data set characterised by abundant numbers of features but wi...
Recently, feature selection and dimensionality reduction have become fundamental tools for many data...
AbstractThe DNA microarray technology has capability to determine the levels of thousands of gene si...
Motivation: The increasing use of DNA microarray-based tumor gene expression profiles for cancer dia...
Abstract: In fact, cancer is produced for genetic reasons. So, gene feature selection techniques are...
(Aim) Gene expression data is typically high dimensional with a limited number of samples and contai...
Background: Feature selection is a pattern recognition approach to choose important variables accord...
Machine Learning methods have of late made signicant efforts to solving multidisciplinary problems i...
Machine Learning methods have of late made signicant efforts to solving multidisciplinary problems i...
AbstractClassification of gene expression data plays a significant role in prediction and diagnosis ...
Identifying genes linked to the appearance of certain types of cancers and their phenotypes is a wel...
Gene expression data often need to be classified into classes or grouped into clusters for further a...